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An Experimental Investigation of Wide-Angle 
Sidelobe Suppression in a Pyramidal Horn- 
Reflector Antenna 


by C. A. SILLER, JR.* 
(Manuscript received September 22, 1983) 


An experimental investigation into the suppression of wide-angle radiation 
from a pyramidal horn-reflector antenna is reported in this study. We show 
that lining the sidewalls or diffracting edges of this antenna with microwave 
absorber in the region of the visible aperture can lead to a marked reduction 
in sidelobe energy when the antenna is transversely polarized. Specifically, 
sidelobe reductions of as much as 10 dB are attainable over angular regions 
40 degrees or more from the main beam. Though the described absorber 
treatment reduces wide-angle radiation, it can simultaneously increase and 
distinctly modify the character of near-in sidelobe structure. Data reported in 
this study are related to those of a recently published paper on selective near- 
in sidelobe reduction for a pyramidal horn-reflector antenna. 


1. INTRODUCTION 


The judicious application of microwave absorber as a method to 
modify reflector antenna sidelobe levels is a well-known and powerful 
technique. The approach is especially useful in the design of large- 
aperture antennas for terrestrial radio communications, where RF 
interference into or from other radio links is of paramount and 
increasing concern. 

In a recent paper, R. A. Semplak’ presented data from an experi- 
mental investigation of selective near-in sidelobe reduction of a pyram- 
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idal horn-reflector antenna. That work is especially important for 
those interference situations occurring at small angles from the an- 
tenna beam, where the antenna discrimination is naturally less than 
that at large angles. On the other hand, wide-angle antenna radiation 
often limits the number of converging routes into a microwave radio 
station, a potentially acute situation at metropolitan junctions. For 
the pyramidal horn-reflector antenna, this latter problem was amelio- 
rated for longitudinal polarization* by modifying radome attachment 
methods? and for transverse polarization by attaching multiple-edge 
blinders to the sidewalls of the antenna.*” This paper is related to 
Semplak’s, in that we also report the results of an experimental 
investigation into sidelobe suppression in a pyramidal horn-reflector. 
Unlike Semplak’s investigation, however, our focus is on broadband, 
wide-angle sidelobe suppression. We show, as Semplak did, that side- 
lobe suppression is dependent on where the absorber is specifically 
situated, some locations being preferable to others. Additionally, our 
findings highlight the fact that radiation from this antenna is not 
completely modeled in prior analytic studies, since they fail to com- 
pletely predict the radiation patterns of this antenna. 

Pyramidal horn-reflector antennas, one generic type in a family of 
horn-reflectors, are of considerable practical significance.® They are 
widely deployed throughout the world, with domestic use alone ex- 
ceeding more than 10,000. In Semplak’s study a scaled-down precision 
antenna was measured at 30 GHz. For this study we used a precision, 
fiberglass antenna specifically designed for line-of-sight operation in 
the 4-, 6-, and 11-GHz common carrier bands.’ This antenna stands 
approximately 16 feet high, has a 70-inch offset paraboloid illuminated 
by a 29-degree radial pyramidal horn, and is equipped (for a portion 
of this study) with 17-edge blinders.® A photograph of this antenna 
appears in Fig. 1. Though this fiberglass antenna has an aperture area 
only 60 percent that of the standard pyramidal antenna used in the 
AT&T Communications network (with a 90-inch focal length), it 
affords uncommonly good wide-angle sidelobe suppression. Indeed, in 
some cases this antenna provides far-out sidelobe levels lower than 
that of the larger pyramidal horn-reflector. These last observations 
are noteworthy since we demonstrate in this paper that the wide-angle 
radiation can be further suppressed by as much as 10 dB using absorber 
on the sidewalls of the antenna aperture. Unfortunately, this absorber 


* We use the conventional definitions for longitudinal and transverse polarizations 
for this antenna? For terrestrial applications, the pyramidal horn axis is perpendicular 
to the local plane of the earth, and longitudinal polarization corresponds to aperture 
electric fields perpendicular to that horizontal plane, while transverse polarization 
corresponds to electric fields parallel to the earth’s surface. All patterns are measured 
in the transverse plane, an azimuthal plane parallel to the earth’s surface. 
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Fig. 1—Fiberglass, pyramidal, horn-reflector antenna equipped with multiple-edge 
blinders. 
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treatment simultaneously increases the near-in radiation lobes. It may 
well be that a combination of Semplak’s methods, with selected 
microwave absorber placement like that described herein, could sup- 
press sidelobe energy throughout major regions of the antenna’s azi- 
muthal radiation plane. 


Il. EXPERIMENTAL MEASUREMENTS AND INTERPRETATION 


The experimental measurements reported in this study were carried 
out in two phases. In the first phase the antenna was equipped with 
17-edge blinders, and the interior sidewalls in the region of the visible 
aperture were covered with 2-inch-thick, hair-type absorber. This 
absorber has a near-normal, specular power reflection coefficient less 
than 0.01 at 4 GHz, with an even smaller reflection at the higher 
frequencies used for part of this study. Absorber installation in the 
antenna is schematically illustrated in Fig. 2, where the multiple-edge 
blinders are omitted for clarity. In the second phase of the study, the 
blinders were removed, and a 3.5-inch strip of 2-inch-thick absorber 
was recessed and placed adjacent to the side diffracting edges. This 
situation, illustrated in Fig. 3, will be deferred for subsequent discus- 
sion. 


2.1 Absorber-lined antenna interior 


For this first phase of the investigation, complete principal and 
cross-polarized patterns were measured at a single frequency in each 


ABSORBER-LINED 
77 INTERIOR SIDEWALLS 





Fig. 2—Absorber installation on sidewalls of antenna. Edge blinders omitted for 
clarity. 


534 TECHNICAL JOURNAL, APRIL 1984 


of the 4-, 6-, and 11-GHz common carrier bands. Typical radiation 
pattern measurements, with and without absorber modification, are 
presented in Figs. 4 and 5, respectively. Complete 360-degree azimuthal 
plane patterns are measured in two stages. For radiation near the 
main beam, a 30-dB RF attenuator is used to limit power into the RF 
receiver. In the vicinity of 10 degrees, where the power is significantly 
lower, the attenuator is removed and the remainder of the pattern is 
measured. Patterns like those in Figs. 4 and 5, therefore, appear 


pap PESOREER-LINED EDGES 






Fig. 3—Absorber installation along exterior, recessed, side diffracting edges. Edge 
blinders omitted for this portion of study. 
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Fig. 4—Radiation pattern of antenna equipped with sidewall absorber and multiple- 
edge blinders. Transverse polarization, transverse (azimuthal) measurement plane; 3.95 
GHz. (Note 30-dB change in ordinate scale near +10 degrees.) 


ANTENNA SIDELOBE ENERGY SUPPRESSION = 535 


RELATIVE AMPLITUDE IN DECIBELS 





-90 
-180 -160 -140 -120 -100 -80 -60 -40 -20 0 20 40 60 80 100 120 140 160 180 
ANGLE IN DEGREES 


Fig. 5—Radiation pattern of antenna equipped with multiple-edge blinders but with- 
out absorber. Transverse polarization, transverse measurement plane; 3.95 GHz. (Note 
30-dB change in ordinate scale near +10 degrees.) 


discontinuous with a corresponding 30-dB change in the ordinate. 
Because the absorber differentially affects near-in and wide-angle 
radiation sidelobes, results are most conveniently presented in sepa- 
rate subsections. 


2.1.1 Absorber-lined antenna interior: effect on near-in sidelobes 


Figures 6 and 7 present typical near-in sidelobe performance for 
3.95-GHz longitudinal and transverse polarization. These figures also 
include comparative patterns for the antenna without absorber. The 
presence of absorber has little effect for longitudinal polarization (Fig. 
6), while it notably raises the sidelobe levels and changes the character 
of sidelobe structure for transverse polarization (Fig. 7). 

It will be remembered that the projected aperture fields for this 
antenna correspond closely to the illumination provided by dominant 
mode square waveguide. For longitudinal polarization, aperture fields 
are tangent to, and therefore vanish at, the antenna sides. Thus, for 
this polarization state, the aperture fields are unperturbed and hardly 
affect the near-in pattern except for a very slight decrease in aperture 
width (approximately 5.4 percent). 

For transverse polarization, aperture electric fields are normal to 
the sidewalls and are greatly affected by the presence of absorber. 
Though one might initially think that the absorber attenuates and, 
therefore, acts to taper the aperture field, giving rise to reduced near- 
in sidelobes, Fig. 7 indicates this is not the case. This somewhat 
unexpected result could be attributable to aperture phase errors or 
blockage. Aperture phase errors tend to dominantly fill in the first 
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Fig. 6—Near-in radiation pattern. Antenna equipped with multiple-edge blinders. 
Longitudinal polarization, transverse measurement plane; 3.95 GHz. 
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Fig. 7—Near-in radiation pattern. Antenna equipped with multiple-edge blinders. 
Transverse polarization, transverse measurement plane; 3.95 GHz. 
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Fig. 8—Smoothed radiation pattern. Antenna equipped with blinders. Transverse 
polarization, transverse measurement plane; 3.95 GHz. 


radiation null, with secondary nulls only secondarily affected.® Our 
measurements indicate that the first null is invariably preserved, with 
filling in occurring beyond that point. Moreover, phase errors could 
cause beam shifting in the longitudinal (vertical) plane, an effect not 
observed in these measurements. A more likely explanation is the 
excitation of certain higher-order modes, since the absorber was not 
recessed but partially blocked the aperture. (This effect would be 
minimal for longitudinal polarization since the fields are already low 
along the antenna sides.) This hypothesis is difficult to confirm 
without recessing the absorber, an impossibility in this particular 
design and an almost intractable problem from an analytic standpoint. 
It is worth noting though, that even radiation from the higher-order 
modes should be insignificant beyond several beamwidths (e.g., see 
Ref. 10), suggesting that the absorber fundamentally causes increased 
sidelobe levels in the approximate 15- to 40-degree region. This point 
is discussed further in Section III. 


2.1.2 Absorber-lined antenna interior: effect on wide-angle sidelobes 


Figures 4 and 5 show that absorber can significantly affect wide- 
angle sidelobe levels, especially for transverse polarization. This point 
is further illustrated in Figs. 8 through 11, which present smoothed* 
directivity patterns for 3.95-GHz principal and cross-polarized opera- 
tion. Note that when the antenna receives a transversely polarized 


* Smoothed radiation patterns are prepared by the widely accepted technique of 
drawing an envelope across peaks in the antenna radiation pattern. We show a 180- 
degree envelope that represents the worst sidelobe performance over the full 360-degree 
transverse measurement plane. 
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Fig. 9—Smoothed radiation pattern. Antenna equipped with blinders. Longitudinal 
polarization, transverse measurement plane; 3.95 GHz. 
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Fig. 10—Smoothed radiation pattern. Antenna equipped with blinders. Transverse 
polarization response of longitudinal polarization illumination, transverse measurement 
plane; 3.95 GHz. 


signal (co- or cross-polarized), sidelobes beyond approximately 40 
degrees are generally reduced, sometimes by 10 dB or more. Even for 
longitudinally polarized reception, some improvement can be attained, 
though the differences are not as significant. Radiation patterns were 
also measured for 6- and 11-GHz operation. These measurements 
show the same qualitative effects as discussed above and are therefore 
omitted for brevity. 

Wide-angle sidelobe suppression, like that noted above, is commonly 
attributed to reduced edge diffraction. If that were the case, a narrow 
strip of absorber adjacent to the side diffracting edges would accom- 
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plish the same function. This issue was explored in the second phase 
of the investigation. 


2.2 Absorber-lined diffracting edges 


In the second phase of our study, the multiple-edge blinders were 
removed, and a 3.5-inch strip of 2-inch-thick absorber was recessed 
and placed adjacent to each of the side diffracting edges, as illustrated 
by Fig. 3. Transverse polarization patterns were measured, with a 
typical result shown in Fig. 12 (near-in sidelobes were unaffected). We 
have noted that beyond approximately 40 degrees, the narrow absorber 
strip reduces sidelobes by 4 dB or more. This performance, while less 
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Fig. 11—Smoothed radiation pattern. Antenna equipped with blinders. Longitudinal 
polarization response to transverse polarization illumination, transverse measurement 
plane; 3.95 GHz. 
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Fig. 12—Smoothed radiation pattern. Antenna not equipped with blinders. Trans- 
verse polarization, transverse measurement plane; 3.74 GHz. 
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impressive than the more consistent suppression exemplified by Fig. 
8, shows the benefit of using absorber, and more clearly defines the 
diffraction-limited region of this antenna as lying beyond 40 degrees 
in the azimuthal plane. 

After comparing the two absorber placement tests described above, 
we conclude: (1) within approximately 15 degrees of the main beam, 
lining the antenna sidewalls increases sidelobe levels, an effect prob- 
ably due to the excitation of higher-order modes; (2) beyond approxi- 
mately 40 degrees, the radiation pattern is dominantly affected by 
exterior aperture edge diffraction, and absorbent materials can signif- 
icantly reduce that contribution to the far-field pattern; and (3) in the 
intermediate angular region of 15 to 40 degrees, absorber on the 
sidewalls affects the radiation pattern in unexpected ways. This last 
point is further developed in the next section. 


lil, ANALYTIC METHODS FOR DESCRIBING RADIATION FROM 
PYRAMIDAL HORN-REFLECTOR ANTENNAS 

The Aperture Field Method (AFM) and Geometrical Theory of 
Diffraction (GTD) are the principal analytic methods used to describe 
radiation from large-aperture, reflector antennas. A number of differ- 
ent studies have shown the AFM to be quite accurate in predicting 
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Fig. 13—Measured and theoretical patterns of a pyramidal horn-reflector antenna 


without multiple-edge blinders. Transverse polarization, transverse measurement plane; 
4 GHz (adapted from Ref. 13). 
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sidelobe behavior down to -40 dB and within approximately 10 degrees 
of the main beam of a pyramidal horn-reflector antenna,”"!* Theo- 
retical limitations associated with this method for wide-angle radiation 
are well established. 

Wide-angle radiation patterns are predicted using GTD. Transverse- 
plane analyses of the pyramidal horn-reflector by Thomas*’ and Siller® 
considered single diffraction from the exterior aperture edges with 
equivalent magnetic line sources. Their theoretical patterns showed 
good qualitative agreement with measured characteristics. Mentzer 
has used both the AFM and a more complete GTD model to analyze 
radiation from this antenna.”* This latter analysis includes radiation 
from both the top and bottom aperture edges, using slope wave 
_ diffraction, as well as reradiative multiple diffraction. A result from 
Mentzer’s study is presented in Fig. 13 for transverse polarization of 
the aforementioned standard horn-reflector antenna. Envelope dis- 
crepancies beyond 120 degrees, where the pattern is nearly 80 dB 
down, are not important and are probably due to radiation leakage 
during the measurements and/or small inadequacies in the analytic 
model. The pattern disparity between approximately 10 and 45 degrees 
is more prominent, however, and suggests radiating sources heretofore 
unconsidered. 

Mentzer’s analytic study did not include two sources of radiation 
that are intimately affected by absorber on the sidewalls of the visible 
antenna aperture. The first of these sources corresponds to wedge 
diffraction at the interface of the paraboloid and horn sidewalls. 
Radiation from these two wedges would be influenced by Semplak’s 
absorber placement, as well as our own (see Fig. 2, appropriate to the 
first phase of this study). The second of the unconsidered sources 
corresponds to an infinite set of surface currents imaged into the walls 
of the visible aperture from the offset paraboloidal reflector. These 
imaged sources would also be influenced by our first absorber treat- 
ment. Simple ray optics and diffraction theory indicate that both of 
the aforementioned sources would radiate predominantly into the 0- 
to 45-degree region, though their relative contribution to the composite 
radiation pattern is presently unknown. 


IV. CONCLUSIONS 


An experimental investigation into the suppression of wide-angle 
radiation from pyramidal horn-reflector antennas is reported in this 
study. We first show that lining the sidewalls of the antenna dramat- 
ically affects sidelwbe structure and energy, with sidelobe reduction 
occurring over a wide angular region for transverse polarization oper- 
ation. Elevated near-in sidelobes are probably due to some aperture 
blockage, as well as the modification of radiation sources that have 
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not previously been considered in theoretical analyses of this antenna. 
Subsequent absorber placement on the antenna’s side diffracting edges 
also leads to wide-angle sidelobe suppression and helps to identify the 
pattern region beyond approximately 40 degrees as predominantly 
diffraction limited. 

These experimental results are related to a recent study by R. A. 
Semplak on selective near-in sidelobe reduction for a pyramidal horn- 
reflector antenna. Taken together, the investigations suggest that 
judicious microwave absorber placement could suppress sidelobe en- 
ergy throughout major (i.e., near-in and far-out) regions of the azi- 
muthal radiation plane. 
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BER Degradations Caused by Switching in 
Digital Mobile Radio Systems Using Base Station 
Diversity 


By B. S. GLANCE* 
(Manuscript received August 3, 1983) 


This paper presents a study of the Bit Error Rate (BER) degradation 
resulting from base station switching in digital mobile radio systems using 
base station diversity to combat shadow fading. The degradation is caused by 
the discontinuities of the signal received by the mobile unit when transmission 
is switched from one base station to another. To evaluate this effect, a simple 
statistical model has been devised for the spatial variations of shadow fading. 
It consists of a one-parameter spatial autocorrelation function for the (Gaus- 
sian) decibel value of the fading loss, which can be easily simulated. The single 
parameter is a correlation length, which can be varied to emulate different 
fading conditions in the urban environment. The shape of the autocorrelation 
function can similarly be varied. This model was used to evaluate the BER 
degradation of mobile radio systems using Phase-Shift-Keying (PSK) modu- 
lation. The results show that, in worst-case conditions, the BER is insignifi- 
cantly affected by switching for BER values above 9 x 10~° for two-PSK and 
1.5 X 10~* for four-PSK. Adding a threshold into the switching test can reduce 
or increase the switching degradation, depending on the threshold value. 


I. INTRODUCTION 


Shadow fading in mobile radio propagation is caused by large 
obstacles blocking the transmission path. This effect is combated in 
cellular systems by using base station diversity.’ The arrangement 
consists of several base stations at different locations in the cell, which 


*AT&T Bell Laboratories. 
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provide path diversity. Only one base station transmits to the mobile 
receiver. Switching tests are done at regularly spaced intervals, Ax, to 
compare the signals received by the base stations. As the mobile unit 
moves, the transmission is switched from one base station to another 
according to the base station receiving the strongest signal. 

In mobile radio systems using digital modulation, the signal discon- 
tinuities caused by base station switching give rise to transient errors 
in the recovery of the digital information. The purpose of the present 
study is to evaluate the degradation of the Bit Error Rate (BER) due 
to switching in a cellular system using Phase-Shift-Keying (PSK) 
modulation and three-corner base station diversity. 

Section II defines the mobile radio system used for this evaluation. 
A simple shadow model (including spatial variations) is proposed in 
Section III to emulate the shadow fading conditions in an urban 
environment. This model is used in Section IV to determine the 
statistics of the distance traveled by a mobile unit between consecutive 
switches. This is done for different mobile paths covering the surface 
of a cell and for different fading conditions. Section V evaluates the 
number of errors caused by switching, and Section VI evaluates the 
BER degradation caused by switching for two-PSK modulation. These 
calculations are done for different signaling rates, different distances 
between switching tests, and different fading conditions. The same 
evaluation is then replicated in Section VII for four-PSK modulation. 
Finally, Section VIII assesses the effect on the BER of a threshold 
condition in the switching test. 


Il. THE MOBILE RADIO SYSTEM 


Consider a cellular, digital mobile radio system having a three- 
frequency reuse plan, and with three base stations per cell located at 
alternate corners (see Fig. 1). Shadow fading is combated by selecting 
the base station receiving the strongest mobile signal. This is achieved 
by periodically comparing the medium-term power of the mobile 
signals received by the three base stations; the power is measured over 
a time period adjusted so as to average out the Rayleigh fading.” 

The power measured by a base station depends on the distance 
between the mobile receiver and the base station, and on the shadow 
attenuation affecting that transmission path. The measurement cor- 
responding to the jth path, j = 1, 2, or 3, can be expressed by 
L; 
= (1) 


J 


(S;) = De 
where r; is the distance in miles; S, is the median power received at a 
distance of one mile; L; is a random variable with a log-normal 


distribution, representing the shadow attenuation; and y is the prop- 
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Fig. 1—Three-frequency reuse cellular configuration, with base stations that interfere 
with desired signal at mobile receiver. 


agation law exponent. In our calculations, we assume that y is 3.8 and 
that the standard deviation of the Gaussian variable 10 log,o{L} is 8 
dB.?4 

The signal modulation is either two-PSK or four-PSK, with differ- 
ential encoding. (Differential encoding is required because the mobile 
receiver cannot retrieve the absolute value of the carrier phase.) The 
signal at the receiver output of the mobile unit consists of several 
maximal-ratio-combined space diversity signals, where we assume 
three branches for two-PSK and four branches for four-PSK. The 
number of combining branches in each case represents the minimum 
number required to obtain a medium-term bit error rate (BER) below 
107° with 90-percent probability. The combining can be done at both 
sides of the transmission path or at the base station only by using 
time-division retransmission;’ the switching-induced degradation 
studied here is independent of which scheme is used. 

The system performance is assumed to be primarily limited by the 
interference originating from transmitters using the same radio chan- 
nel in other cells. Most of the interference comes from the nearest 18 
cells* (see Fig. 1). The medium-term signal-to-interference ratio 
({SIR)) is given for the jth base station to mobile unit path by 


. N ][. 
(SIR;) = (4) Ye Fea Soe (2) 
rj i=1 r; 
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For the reception at the mobile unit, the quantity r; represents the 
distance between the ith interfering base station and the mobile unit; 
L; is the shadow attenuation of the same path; and N is the number 
of interfering base stations, a quantity varying between 0 and 18, 
depending on which base stations are transmitting in the 18 interfering 
cells. The quantity (BER) is a function of (SIR) for an uninterrupted 
base station-mobile radio link. When the base station is switched, 
additional errors caused by the signal discontinuity increase the 
(BER). The resulting value of (BER) thus depends on the switching 
frequency. For a given mobile unit path, the switching frequency varies 
according to the position of the mobile unit relative to the three base 
stations and with the values of the shadow attenuation of the three 
corresponding paths. A shadow fading model is proposed in the next 
section for use in calculating this effect. 


Ill. THE SHADOW FADING MODEL 


Measurements of the spatial variation of the shadow fading in an 
urban environment have been reported previously.® No statistical 
analysis has been made for results of this kind. It was therefore decided 
to represent shadow fading along a mobile unit path by a simple one- 
dimensional spatial autocorrelation function for the Gaussian random 
variable 10 logio{L}, which can be easily simulated. The dependence of 
the autocorrelation function on the perpendicular direction to the 
mobile unit path is neglected by assuming a relatively large spacing 
between parallel paths. The autocorrelation function is characterized 
by a single parameter, defined as a correlation length, which can be 
varied to emulate different fading conditions in the urban environ- 
ment. The same effect is also achieved by changing the shape of the 
autocorrelation function. This is done in the present study by using 
two different shapes. The first one, given by 


pi(x) = e774, (3) 


simulates fading conditions having a relatively fast decorrelation be- 
tween fades. The second one, given by 


p2(x) = e7*/*2(1 + x/x2), (4) 


represents fading conditions having a slower decorrelation for dis- 
tances that are small relative to the decorrelation length. The two 
constants, x; and x2, are adjusted so that the decorrelation length is 
the same for both functions, the correlation length being defined as 
the distance x, which makes both correlation functions decrease to 
0.1. 

The autocorrelation function p2(x) represents more realistically the 
spatial variation of shadow fading than does p(x), which we believe 
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to be pessimistic because it decays faster near the origin. The shadow 
fading model is used in the next section to analyze the switching 
statistics, specifically, the statistics of the distance traveled by a mobile 
unit between consecutive base station switches. 


IV. SWITCHING STATISTICS 


The switching statistics are evaluated using a Monte Carlo computer 
program simulating a mobile receiver moving along a given path 
covering the entire cell. For simplicity, the simulation is assumed to 
be statistically stationary for the shadow fading process, including a 
fixed correlation length over the path. The same calculations are 
repeated for different values of the correlation length and for the two 
autocorrelation functions cited above. By assuming an artificial envi- 
ronment (e.g., statistically stationary, with fixed correlation length), 
but considering different shapes and parameter values, we aim to 
identify the critical features of the shadow fading process. 

The signal received by the mobile unit originates from the base 
station that received the strongest medium-term power (S;) [see eq. 
(1)] during the past measurement. A switching test is done after each 
mobile unit displacement, Ax, to compare the updated value of (S;) 
to the similar quantities measured by the two other base stations. 
When (S;) falls below one of the two other measurements, transmis- 
sion is switched to the base station providing the strongest value. 
Otherwise, transmission is continued from the same base station until 
the next test. 

The simulation is done for three different mobile unit paths to 
determine the influence of the path on the switching statistics. The 
diagonal of the hexagonal cell is taken to be 3000m. The first path is 
a meander made of parallel lines spaced every (150V3)/2 meters, 
starting at base station 2 and directed towards the apex located 
between base stations 2 and 3 (see Fig. 2a). The second path is also a 
meander, perpendicular to the first one, starting from the apex located 
between base stations 1 and 2 and directed towards base station 1 (see 
Fig. 2b). The third path consists of concentric circles about the cell 
center, which is the origin of the mobile unit path (see Fig. 2c). 

The distributions of the cumulative probability of the distance 
traveled by the mobile unit between consecutive switches is evaluated 
for fading conditions having decorrelation lengths of 50, 100, and 
200m. This is done in each case for various values of the displacements, 
Ax. Two significant results emerge. First, the distributions correspond- 
ing to different decorrelation lengths can be fitted on the same curves 
when the distance, x, and the displacement, Ax, are both normalized 
by the decorrelation length, xz. Second, the results are almost inde- 
pendent of the path followed by the mobile unit. Both these findings 
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(a) (b) (c) 


Fig. 2—Three different mobile unit paths used for calculations of switching statistics. 
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Fig. 3—Cumulative probability distribution of distance traveled by mobile unit be- 
tween consecutive base station switches, calculated using autocorrelation function p. 


are confirmed by the thickness of the curves in Fig. 3, which represent 
the spreads due to superimposing results for the three paths and the 
three decorrelation lengths. Note that the distributions are, however, 
dependent on the parameter Ax. These curves are all for the autocor- 
relation function p,, [see (3)]. 

Similar results are obtained for the autocorrelation function pe [see 
(4)], as shown in Fig. 4. In this case, the spread due to the different 
paths is slightly larger, but the dependence on the parameter Ax is 
smaller and becomes almost nil for normalized values equal to or below 
0.2. This effect is shown in Fig. 5, which compares the distribution 
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Fig. 4—Same results as in Fig. 3, for the autocorrelation function po. 
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Fig. 5—Comparison between cumulative probabilities obtained with p, and pe for 
same normalized mobile unit displacement values of 0.1, 0.2, 0.4, and 0.8. 
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obtained for the two autocorrelation functions, calculated for a decor- 
relation length of 50m and displacements of 5, 10, 20, and 40m. As we 
expected, the divergence between the distributions is significant for 
Ax values below 10m, but becomes almost nil when Ax is increased to 
20m—a value above which both autocorrelation functions are almost 
identical. Similar results are observed for the distributions calculated 
for the longer decorrelation lengths. 

Another statistical result of interest is the average distance (x) 
traveled by the mobile unit between switches versus the mobile unit 
displacement, Ax. This quantity is averaged over all the distances 
given by the distribution for a given path, correlation function, and 
decorrelation length. These results corresponding to the autocorrela- 
tion functions p; and pe are shown, respectively, in Figs. 6 and 7, where 


AUTOCORRELATION FUNCTION Pp, = eX ly 


NORMALIZED AVERAGE DISTANCE, <x>/xg 
SWITCHING RATIO, Sy, IN PERCENT 





0 1 2 
NORMALIZED DISPLACEMENT, Ax/xg 


Fig. 6—Average distance and switching frequency versus mobile unit displacement 
calculated with p,. 
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AUTOCORRELATION FUNCTION poz = er (4 +x/x2) 


NORMALIZED AVERAGE DISTANCE, <x>/xg 
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Fig. 7—Same results as for Fig. 6, calculated with po. 


(x) and Ax are normalized by xg. The spread shown by these curves 
is again due to superimposing the results for the paths and different 
decorrelation lengths. The other curves shown in these figures repre- 
sent the switching frequency, measured as the ratio of the number of 
switches to the total number of tests along the path. Note that the 
frequency increases to about one third when the displacement is above 
one decorrelation length. This maximum frequency is equal to that 
given by random switching between the three base stations. Note also 
that the switching frequency is substantially reduced only when Ax/ 
Xq < 0.1, a condition fulfilled for fading distribution having long 
decorrelation lengths. 


V. ERRORS CAUSED BY SWITCHING 


Switching between base stations causes phase and amplitude dis- 
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continuities in the signal received by the mobile unit. These effects 
generate errors in the demodulation process, in addition to those 
produced by the interference corrupting the desired signal. The num- 
ber of switching errors depends on the type of demodulator used for 
recovering the data. Differential detection, which is only sensitive to 
phase discontinuities, gives rise, on average, to one-half error per 
switch because the phase discontinuities are random and uniformly 
distributed. The switching errors generated by a coherent demodulator 
lead to a more complex calculation, which depends on the phase and 
on the amplitude discontinuities. 

The amplitude discontinuity is equal to the difference between the 
instantaneous signal levels received before and after the switch. In the 
radio mobile environment, these quantities vary randomly over very 
short distances. Note that the amplitude of the preswitch signal can 
be larger than that of the postswitch signal, even when the mean value 
of the latter signal is larger than that of the former. The two instan- 
taneous amplitudes are described by independent random variables 
having a chi-square distribution of order three for two-PSK signals 
and order four for four-PSK signals, according to the diversity design 
defined in Section II. 

The number of switching errors is evaluated for a demodulator using 
“modulation wipeoff” carrier recovery, according to a recently pub- 
lished technique.’ This is done by assuming that the preswitch and 
postswitch signals have the same medium-term power and the same 
medium-term SIR. This approximation is valid when the interval 
between switching tests is relatively small compared to a decorrelation 
length. Under these assumptions, one finds that the average number 
of errors varies between one and two per switch for two-PSK differ- 
entially encoded signals, when the (SJR) increases from 0 dB to 
infinity. The same quantity for four-PSK differentially encoded sig- 
nals varies between 1 and 2.6 errors per switch for the same range of 
(SIR) values (see Fig. 8). 

Differential detection thus provides better protection against 
switching errors. For two-PSK, it also gives nearly the same perform- 
ance as coherent detection in the absence of switching. For example, 
the values of (BER) given by the differential and coherent detection 
are, respectively, 4.8 x 107* and 3.1 x 10~* for (SIR) = 9.6 dB in the 
case of a three-branch combiner. (The value 9.6 corresponds to the 
90-percentile (SIR) given by the cellular configuration described in 
Section II.) This result comes from relations® 


1 1 
(BER) og pit * = 


2 [1 + (SIR)}° 6) 


and 
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Fig. 8—Average number of switching errors versus (SIR), calculated for two-PSK 
and four-PSK coherently demodulated signals. 
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For four-PSK, on the other hand, differential detection is consid- 
erably less efficient than coherent detection. In this case, the (BER) 
expressions (5) and (6), applied to a four-branch combiner, become, 
respectively,® 


1 


(BER) spit a ee (7) 
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The (BER) calculated for the same (SJR) value is within the required 
value of 10° for coherent detection, but it increases to 5.5 X 107° for 
differential detection. In this case, the number of combiner branches 
would have to be increased by one to fulfill the (BER) requirement, a 
solution which may not be acceptable. 

Consequently, the switching-induced degradation of (BER) will be 
computed assuming coherent demodulation for four-PSK signals, and 
differential detection for two-PSK signals. 


VI. PERFORMANCE DEGRADATION DUE TO SWITCHING FOR TWO-PSK 
6.1 Calculation of the (BER) 


The degradation is evaluated by comparing the cumulative distri- 
butions of (BER) obtained with and without switching errors. The 
distributions are calculated using Monte Carlo simulations, which 
assume a mobile unit moving along the path shown in Fig. 2a subjected 
to cochannel interference and shadow fading. (The results are essen- 
tially path independent, as noted in Section IV.) 

The (BER) component due to the interference is calculated every 
10m along the mobile unit path according to (5). Switching tests are 
simulated at regularly spaced intervals, the spacing being a parameter 
varying from 10m to several times this value. When a switch occurs, 
the (BER) is incremented by 


Ne 
A(BER) = = (9) 


where N, represents the number of switching errors, one-half in the 
present case; S, is the signaling rate; v is the mobile unit speed; and 
5x is the distance between (SJR) measurements. The calculations are 
made for the maximum mobile unit speed of 55 mph to obtain 
conservative estimates of degradation. 

The results are first presented for the autocorrelation function po, 
which more realistically simulates shadow fading; and for a pessimistic 
correlation length of 50m, the approximate length of a city block. The 
results are given for different signaling rates and several values of Ax. 
The results are then compared to those obtained for longer decorre- 
lation lengths, simulating less severe fading conditions. Finally, the 
results are also compared to those obtained with the autocorrelation 
function pj. 


6.2 Effect of signaling rate 


The cumulative probability distribution was calculated for signaling 
rates of 16, 32, and 64 kb/s, using the autocorrelation function po, a 
correlation length of 50m, and a switching interval of 10m. Figure 9 
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Fig. 9—Cumulative probability of (BER) for two-PSK signals, with signaling rate as 
a parameter. 


shows the results and compares them to the ideal case, i.e., a receiver 
without switching errors. The results show very little switching deg- 
radation for (BER) values above 10~*. Below this value there is a 
significant degradation that shifts to lower (BER) values when the 
signaling rate increases. For the above three signaling rates, the rapid 
change of the distribution begins, respectively, at the (BER) values of 
6.8 x 107°, 3.8 x 10~°, and 1.9 x 107°. These (BER) values are almost 
equal to those given by (9) for the same signaling rates: 7.65 x 10~°, 
3.82 x 107°, and 1.9 x 10°°. At these points, the switching errors 
increase the cumulative probability by, respectively, 76, 58, and 45 
percent. 


6.3 Effect of switching interval 


Figure 10 shows the cumulative probabilities calculated for the same 
autocorrelation function, p2, and the same correlation length, 50m, 
but for different switching intervals, Ax, varying from 10 to 40m. In 
this case, the signaling rate is 16 kb/s. The results show that the 
amplitude of the “jump” occurring at the (BER) value given by (9) 
decreases as Ax increases. This effect is explained by noting that the 
frequency occurrence of the switching tests decreases in inverse pro- 
portion to Ax and the resulting degradation decreases accordingly. On 
the other hand, the use of a large Ax degrades the distribution for the 
large (BER) values because the switching tests are likely to miss the 
instant at which (S;) [see eq. (1)] falls below that received by one of 
the two other base stations. This effect increases with the switching 
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Fig. 10—Cumulative probability of (BER) for two-PSK signals, with switching 
interval as a parameter. 


interval. For example, the 90-percentile value of the cumulative prob- 
ability shifts from a (BER) value of 5 x 10~* to the (BER) values of 
8 x 10%, 1.6 <x 107°, and 2.5 x 107° when the switching interval 
increases from 10m to, respectively, 20, 30, and 40m. 


6.4 Effect of correlation length 


Figure 11 illustrates the effect of the correlation length on the 
cumulative probability distribution of (BER). The results are calcu- 
lated for the autocorrelation function pe, a signaling rate of 16 kb/s, 
and the correlation lengths of 50, 100, and 200m. The same normalized 
switching intervals—0.2 and 0.6—are used for the three correlation 
lengths. As we found previously for the switching statistics, the (BER) 
statistics corresponding to these three correlation lengths fit nearly 
on the same curve for the same normalized switching interval, except 
for the section of the distribution on the left of the discontinuity; the 
amplitude of the discontinuity decreases when the correlation length 
increases. The explanation of this effect is the same as that given in 
the previous section. 


6.5 Effect of autocorrelation function 


The cumulative probabilities obtained with the autocorrelation 
functions p; and p2 are compared in Fig. 12. The results correspond to 
a correlation length of 50m for both functions, a signaling rate of 16 
kb/s and the three switching intervals of 10, 20, and 30m. The two 


558 TECHNICAL JOURNAL, APRIL 1984 


autocorrelation functions give nearly the same distribution for the 
lowest switching interval value. Above this value, the distributions 
corresponding to each autocorrelation function diverge increasingly 
with the length of the switching interval. For example, the 90-percen- 
tile point of the cumulative probability shifts from a (BER) value of 
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Fig. 11—Cumulative probability of (BER) for two-PSK signals, with correlation 
length as a parameter. 
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Fig. 12—Cumulative probability of (BER) for two-PSK signals, with autocorrelation 
shape function as a parameter. 
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8 X 1074 to 1.8 x 1073 for a switching interval of 20m, and from of 4.5 
xX 107° to 6 X 107° for a switching interval of 30m. 


Vil. PERFORMANCE DEGRADATION DUE TO SWITCHING FOR FOUR- 
PSK 


The switching degradation for four-PSK is evaluated for coherently 
detected signals, using the same simulation as for two-PSK. In this 
case, however, the number of switching errors is given by the relation 
shown in Fig. 8, and the (BER) resulting from the interference is 
calculated according to (9). The results are presented as functions of 
the same parameters and for the same conditions as for two-PSK. 


7.1 Effect of signaling rate 


Cumulative probability distributions were calculated for signaling 
rates of 16, 32, and 64 kb/s, using the autocorrelation function po, a 
correlation length of 50m, and a switching interval of 10m. The results 
are shown in Fig. 13 and are compared to the ideal case without 
switching errors. The discontinuities of the distribution are now 
shifted to higher (BER) values, as expected from (9), since the number 
of switching errors is larger for four-PSK. They occur for the above 
signaling rates at, respectively, 1.3 x 1074, 7 x 107°, and 4 x 107°. 
These are about twice the values given for two-PSK. 


7.2 Effect of switching interval 
The effect of the switching interval is illustrated in Fig. 14, which 
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Fig. 13—Same results as in Fig. 9, for four-PSK signals. 
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Fig. 14—Same results as in Fig. 10, for four-PSK signals. 


gives the cumulative probabilities calculated for intervals varying from 
10 to 40m. They are obtained using the autocorrelation function po, a 
decorrelation length of 50m, and a signaling rate of 16 kb/s. The effect 
of the switching interval is essentially the same as that found for two- 
PSK. The differences are that the discontinuities are shifted to higher 
(BER) values, as previously discussed, and the cumulative probability 
increases at this point by 100 percent (instead of 76 percent, as for 
two-PSK). 


7.3 Effect of autocorrelation function 


The effect of the autocorrelation function is illustrated in Fig. 15, 
which shows the cumulative probability for both autocorrelation func- 
tions calculated for a correlation length of 50m, a signaling rate of 16 
kb/s, and the switching intervals of 10, 20, and 30m. 

As in the two-PSK case, the two autocorrelation functions give 
nearly the same distribution for a switching interval of 10m. Above 
this value, the results are similar to those given by two-PSK, except 
for the positions of the discontinuities. Also, as before, the effect of 
the correlation length is small. 


Vill. EFFECT OF A SWITCHING THRESHOLD 


We conclude the study of the (BER) degradation by evaluating the 
effect of adding a threshold condition in the switching tests. The 
threshold consists of a reference to which the medium-term power 
(S;) received by the transmitting base station is compared; switching 


BER DEGRADATION IN MOBILE RADIO 561 


10° 





— 
vi 
is SWITCHING 
wy INTERVAL AUTOCORRELATION 
2 CURVE (m) FUNCTION 
rc la P2 
10-2 tb - 1 
2a P2 
2b a p1 
3a P2 
3b a ~1 
10-3 
10°6 10°5 10-4 10-3 10-2 1071 


<BER>, PARAMETER X 


Fig. 15—Same results as in Fig. 12, for four-PSK signals. 


is done only if (S;) is smaller than the reference. This condition is 
added to reduce the switching frequency and thus to decrease the 
resulting (BER) degradation. 

The threshold is adjusted relative to the median value of (Sj) (in 
dB) received by the base stations when the mobile unit is at the center 
of the cell. This quantity is the same for the three base stations. 

The simulation was done for four-PSK, the autocorrelation function 
po, a decorrelation length of 50m, a switching interval of 10m, and a 
signaling rate of 16 kb/s. The results are shown in Fig. 16 for the 
threshold values of 0, —4, and —8 dB, and are compared to the 
cumulative probability obtained without a threshold for the same 
parameter values. The comparison shows a significant reduction of 
the discontinuity caused by switching, but at the cost of degrading the 
system performance for the (BER) values above A(BER). The best 
compromise is obtained for a threshold of 0 dB. 


IX. CONCLUSION 


A simple statistical model has been devised for the spatial variations 
of shadow fading, consisting of a one-parameter spatial autocorrelation 
function for the log-normal fading loss variable. The shape of the 
function and the value of the parameter can be varied to emulate 
different fading conditions in the urban environment. This model was 
used to simulate shadow fading for a mobile unit moving along a given 
path in a radio mobile cellular configuration using three-corner base 
station diversity. 
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Two significant geometric results emerge from this simulation study. 
First, the cumulative probability curves for the distance traveled by a 
mobile unit between consecutive switches, obtained for different fading 
conditions, can be fitted on the same curves when the distance between 
switches, and the length of the interval between switching tests are 
both normalized by the correlation length. Second, the above distri- 
bution, the average distance between switches, and the switching 
frequency are all nearly independent of the path followed by the 
mobile unit. 

The above results were used to evaluate the (BER) degradations 
due to switching, for both two-PSK and four-PSK modulations. We 
found that, in the worst condition—a mobile unit moving at 55 mph, 
a decorrelation length equal to a city block, switching tests done every 
10m, and a (low) signaling rate of 16 kb/s—the distributions are 
insignificantly affected by switching for (BER) values above 9 x 107° 
for two-PSK, and above 1.5 X 1074 for four-PSK. These (BER) values 
are found to be about equal to the (BER) increment caused by 
switching for the respective modulation. The cumulative probability 
roughly doubles below these values. The amplitude of this effect is 
reduced by increasing the length of the switching test interval, but the 
improvement is more than canceled by an overall degradation of the 
cumulative probability distribution. 

It was found, as expected, that the switching degradation is reduced 
when the correlation length increases. The same effect is obtained by 
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Fig. 16—Cumulative probability of (BER) for four-PSK signals, calculated with a 
threshold switching condition. 
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using a higher signaling rate or for a mobile unit moving at lower 
speed (if the switching tests are done at the same intervals). 

The addition of a threshold condition in the switching test reduces 
the amplitude of the discontinuity of the cumulative probability func- 
tions. However, this is achieved at the cost of degrading the distribu- 
tion for large (BER). An acceptable compromise between these two 
effects is obtained for a threshold of 0 dB. 

Finally, the (BER) statistics are found to be nearly independent of 
the shape of the autocorrelation function if the switching tests are 
done at intervals smaller than or equal to 0.2 times the correlation 
length. Above this value, the results obtained for different shapes 
diverge increasingly with the length of the interval. 
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Processing Channel-Bank Spectrometer Data 
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A channel-bank spectrometer produces weighted samples of its input spec- 
trum, the weighting function being determined by the shape of the channel 
filters. We desire to compare two different input spectra, with similar shape 
but different widths, from their corresponding weighted samples. This problem 
arises in millimeter-wave astronomy. For example, a given astronomical source 
may contain two isotopes of the same molecule. If we know their rest frequen- 
cies, comparison of their observed spectra, with appropriate frequency shifts 
and scale changes, can determine whether the two isotopes have the same 
velocity distribution. Given the weighted samples for a particular input spec- 
trum, we find the optimum way to calculate the samples for a narrower 
spectrum having the same shape. We determine the noise and distortion both 
for this optimum method and for an approximation to it. These results are 
compared for a particular example with those obtained from an ad-hoc 
algorithm called “Squish”, currently used for this purpose; in this case Squish 
is almost as good as the approximation to the optimum method. The present 
results permit such comparisons in other cases that may be of interest, where 
the present method may offer significant improvement. 


I. INTRODUCTION 


Consider two well-separated spectral lines observed in a particular 
radio astronomical source. The intrinsic line widths are usually much 
smaller than the Doppler spread. If the lines correspond to two isotopes 
of the same molecule and have the same velocity distribution, these 
two lines will have the same normalized shape, the higher-frequency 
line being wider. Let their spectral densities be P,,(f) and P,,(f), 
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centered on frequencies /, and fo, respectively, with f. > fi. Since 
Doppler frequency shift Af and relative velocity v are related by 
Af/f = v/c, where c is the velocity of light, the line widths are 
proportional to the ratio of center frequencies 


p aly (1) 


ill 
mph 


and the line shapes are related by 
P.( pf) = C-Ps,(f — fo + fi) (2) 


if the two isotopes have the same velocity distribution, where the 
“shrink” parameter p is given by (1) and C is a line strength parameter. 
We call P,,(pf) the “shrunk” version of P,,(f). 

The spectral density is not observed directly, but rather measured 
with a channel-bank spectrometer. This consists of a fixed bank of 
equally spaced filters of identical shape, each with its own detector. 
The input spectrum is first translated in frequency so that it lies at 
the center of the band covered by the filter bank. The detector outputs 
represent weighted, noisy samples of the input spectral density. 

: Let the translated spectra of the two isotopes be denoted P,,(f) and 
P..(f); their centers now coincide. Then if 


Pf) = C-P..(f) (3) 


the two isotopes have the same velocity distribution. We must deter- 
mine whether this is so by examining the spectral samples, correspond- 
ing to P.(f) and Pf ), produced by the spectrometer. To do so, we 
must compute the spectral samples of the shrunk spectrum P.,( of ) in 
terms of the measured spectral samples of the actual input spectrum 
P,,(f). 

We therefore consider the following problem. A spectrometer with 
input spectral density P,(f) produces weighted spectral samples x,, 
with sample noises n;. Expressions for x; and for the statistics of the 
ny, are given in the next section, and derived in the appendix: Denote 
the noiseless samples of the shrunk spectrum P,( pf) by x,,,. Then we 
wish to estimate the set {x,,} from the set {x, + n,}, and determine 
the mean-square estimation errors due to noise and distortion. This is 
accomplished by standard linear estimation techniques. 


Il. CHANNEL-BANK SPECTROMETER ANALYSIS 


A typical section of a channel-bank spectrometer is illustrated in 
Fig. 5 in the appendix. The input signal plus noise are filtered, square- 
law rectified, and integrated to produce the output spectral sample. 
Equation (67) gives the spectral sample in terms of the input spectral 
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density and the common channel filter shape. For convenience, we 
consider an analogous time-domain model, in which approximate 
samples of a pulse x(t) are obtained as follows: 


Xp = i, x(t)hatt —kT)dt, -o<k<om, (4) 


The spectrum being measured is represented as x(t), and hence is real, 
positive, and symmetric; the weight function a(t) represents the square 
magnitude of the baseband equivalent channel-bank filter transfer 
function (64), and so it is real and positive, but not necessarily 
symmetric: 


x(t) = x*(t) = x(-t) = 0; a(t) = a*(t) = 0. (5) 


Denote the Fourier transforms of x(t) and a(t) by X(f) and A(f), 
respectively. If x(t) is passed through a filter with transfer function 
A(-—f), as in Fig. 1, the x, are sample values of the filter output. X(f) 
is real and symmetric (but not necessarily positive), while A(/) is 
Hermetian. The weight function is normalized to unity at t = 0, its 
nominal center: 


a(0) = ii A(f)df = 1. (6) 


This corresponds to unit gain at midband of the channel-bank filters. 
A constant input will yield constant samples scaled by A(0), the dc 
gain of the weighting filter: 


x(t) = C, Xp = c { a(t)dt = C-A(0). (7) 


The second-order sample noise statistics for a simultaneous chan- 
nel-bank spectrometer are given in (76) and (86) when the integration 
time 7 for the integrator in Fig. 5 is large compared to the reciprocal 
channel-bank filter bandwidth (75). Thus, sample noises n; are added 
to each of the samples x; of (4); the resulting noisy samples are 


Xp + Np, —-~7< k< om, (8) 


y(t) =x,(t) t+0,(t) 






co 
alt) 7> 8(t-KT) 


K =-00 


Fig. 1—Model of channel-bank spectrometer. 
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The n, have the following statistics: 


(ni) = N (9) 
and 
{ a(t)a(t + xT)dt 
(Need) = NOS Oo = =. (10) 
a) a?(t)dt 


The noise power N (77) is directly proportional to receiver noise 
temperature squared and inversely proportional to integration time. If 
different channel-bank filters overlap very little, i.e., if the weight 
functions a(t — kT) for different k in (4) have little overlap, then 
different noise samples will have small correlation (10). 

Finally, we recover an output y(t) by a linear stationary operation 
on the infinite set of noisy samples (8), 


y(t) = T 2 (xp + nx)h(t — kT). (11) 
The weight function h(t) chosen for reconstruction depends on what 
we desire as output. 

The above set of operations may be represented by the block diagram 
of Fig. 1. The pulse x(t) is passed through a filter with transfer function 
A(-f), where A(f) is the Fourier transform of a(t) of (4): 


A(f) = i, a(t)e "dt. (12) 
Bandlimited Gaussian noise n(t) is added, with spectral density 
PAP) =NT YS de Pl, fl <ae 
sak (13) 


] 
P,(f) = 0, >—. 
(f) Vee 
Here ¢,, is the normalized discrete covariance of the noise samples, 
given in terms of the channel-bank filter characteristic a of (4) by (9) 
and (10). If the different filters overlap very little, the noise spectrum 


of (13) will be almost white. The noisy filtered output is sampled at 
interval T; finally, a reconstruction filter with transfer function 


H(f) = { h(t)e "dt (14) 
yields an output y(t) composed of a signal component x,(t) and noise 
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component n,(t) as shown. From (11) we get: 


x(t) =T YS xh(t - kT) 
hae (15) 


n(t)=T Y nh(t — RT). 
k=—00 
Returning to the problem in the introduction, from the noisy sam- 
ples (8) we wish to estimate 


Xpk = i: x(pt)a(t — kT )dt, —71 << k< 0, (16) 


with 
p>, (17) 


i.e., the spectrometer response to the shrunk spectrum. In the noiseless 
case if we require that this be done without error, then we must be 
able to perfectly reconstruct x(t) from x, of (4); this requires that x(t) 
be bandlimited: 


X(f)=0, f>Ww; WT < 0.5. (18) 


There is no aliasing as long as the bandwidth W of x(t) is less than 
1/(2T), where T is the sampling interval. Since only noise exists 
outside this band, | f | > W, we take the reconstruction filter transfer 
function to be zero there, whatever its purpose: 


A(f)=0, |fl>W. (19) 
The output noise n,(t) of Fig. 1 is stationary, as we see by (13) and 
(19). 
We now determine H(f), | f | < W, in order to recover the “shrunk” 
samples (16) from the noiseless samples (4). 
Ill. DISTORTIONLESS ESTIMATE OF SHRUNK SPECTRUM 


Assume that x(t) is bandlimited (18) and that the added noise (9) is 
zero, N = 0. We wish to choose h(t) in (11) such that the samples 
x-(pkT) of (15) and x,, of (16) are equal: 


x(pkT) = Xp. (20) 
From (18), (19), and Fig. 1 we see that 
X(f) = A(-f)A(f)X(f). (21) 
For (20) we require 
oe (4) = A(-f)X (4) (22) 
p p 
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Comparing (21) and (22) we get 


A(—pf) 


qa? 
H(f) = 0< WT < 0.5. (23) 
0, Ifl > W; 


Equation (23) gives the transfer function for the ideal reconstruction 
filter for the bandlimited case (18), in the absence of noise. The band- 
limited case in our time-domain model corresponds in the appendix to 
an input spectral density P,(/f) that varies sufficiently slowly compared 
to the channel-bank filter spacing, F. We will use the filter (23) 
throughout, and calculate the resulting errors due to noise, and to 
linear distortion and aliasing when (18) is not satisfied. 

The noise n,(t) in Fig. 1 is stationary; consequently, the noise 
associated with the samples (20) is 


i A(pf) 
2X 


2 


df, WT < 0.5, (24) 





where P,(f) is given by (13). If the noise is almost white, we may set 
P,(f) = NT. 

The reconstruction filter characteristic (23) depends on the shrink 
factor, p. If for a given p we desire a fixed filter for all spectra that can 
be handled by the spectrometer, we take WT = 0.5. However, if we 
know that a given spectrum varies more slowly, we can improve the 
signal-to-noise ratio by taking a smaller value for W. The minimum 
value for W, for bandlimited x(t) (18), is determined by our present 
requirement that the aliasing be zero. 


IV. DISTORTION IN ESTIMATE OF SHRUNK SPECTRUM 


Suppose now that x(t) is not strictly bandlimited as in (18), but 
merely that X(f) falls off rapidly for | f| > 1/(2T). Suppose further 
that we estimate the shrunk spectrum by the reconstruction filter of 
(23) with W = 1/(2T) (recall that W is no longer the bandwidth of 
x(t)). There will now be linear distortion and aliasing in the estimate. 
The principal aliasing will come from the first-order modulation 
products at the sampler output in Fig. 1. Assume that the added noise 
(9) is zero, N = 0. Then (20), (21), and (22) become, respectively: 


x(pRT) = xp + d(pkT), (25) 
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X,(f) = X(f)A(—pf) + x ( = i) A (-/ is i) 


1 1\} A(—pf) a 
+x(reg)a(- - #1] ach’ fl Sor 
1 
X(f) = 0, Hea 
x(leacna{l)+o(). an 
p p p 
If we compare (26) and (27), the distortion D(f) is given by 
1 1 1 1\| A(pf) 
pin =[x (1-3) (4+ 5) + (P49) 4 (4-9) Ge 


1 
lfl<op (28) 





(26) 





1 
D(f) = -A(-ef)X(f), FL > op: 


The inverse transform of (28), d(t), determines the distortion in the 
reconstructed samples of the shrunk spectrum (25). The noise remains 
as given by (24). 

If X(f) is strictly bandlimited (18) to | f| < 1/(2T), d(¢) = 0, and 
we have perfect reconstruction of the shrunk spectrum (Section II) 
in the absence of noise. 


V. SUMMARY OF RESULTS 


An input spectrum x(t) is analyzed by a channel-bank spectrometer, 
producing spectral samples x; (4) with additive noise samples n, (8) 
through (10). 

A shrunk spectrum x(pt), 1 < p, will similarly produce spectral 
samples x,,», with the same noise samples nj. 

We estimate x,, by y(pkT), with y(t) given by (11) with h(t) the 
inverse transform of (23). Then: 


W(pkT) = xp + n, + d(pkT). (29) 


In the absence of noise and distortion, the second and third terms 
of (29) are zero, and y(pkT) gives perfect results for the spectral 
samples x,, of the shrunk spectrum x(pt). Zero distortion requires 
that X(f) be bandlimited (18), i.e., X(f) =0,f>1/(2T). 

The noise power (n?) is given by (24). 
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The distortion d(t) is approximately the inverse Fourier transform 
of (28). This may be evaluated if we know the input spectrum x(t). 
However, normally this is what we are trying to measure; it is of course 
obvious that d(t) cannot be determined merely from the spectral 
samples x;, even in the absence of noise. Consequently, (28) is of value 
only for calculating the distortion in representative cases, not for 
removing its effects in a given set of measurements. 


Vi. SINGLE-POLE FILTER 

Assume the channel-bank filters are single pole. For this character- 
istic the above results are readily written out in closed form. Then 
a(t) of (4) and its Fourier transform A(f) are: 


pags = 

2 

a (£) 
t 

A(f) = ate ore! fl, (31) 


The 3-db half-width of the filter is t,. Assume the spacing between 
adjacent filters is such that their transfer functions coincide at the 
3-db points: 


a(t) = (30) 


and 








T = 23: (32) 
The noise sample correlation is 
1 
dm = 1 + m? * (33) 
The power spectrum of the added noise (Fig. 1) is 
= 1 : 1 
P,Af) = NT. > eentn vise 
eee ne : 2T 
. (34) 
1 
P,(f) = 0, oe 
(f) dee 
The transfer function of the reconstruction filter is (23) 
1 
H — po tT(e-DIFI pana 
(f) =e * LlflS pe 
; (35) 
= ernie 
Hif=0,  Ifl>5a 
The impulse response of the reconstruction filter is 
aT(p — vf — rea “f + Sele 2” sin r 
h(t) = 2 (36) 


[tT(p — 1)} + (2zt)? 
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Observe that as p > 1, h(t) — [sin 2(t/T)]/(at). 
The Fourier transform of the distortion d(t) is (28): 


Te nee = tek 
iG are ee "| x(r +) 
1 = 1 
_ aT —% 5—™p we: 2xTf 
D(f) = 9 e “e |x (4 i) a (37) 


1 1 

+x(r+4)) 0<f<or 

TD i 
Dip=-Ser™xp, fl > sa 


For x(t) essentially bandlimited, X(f) will fall off rapidly for 

|f| >1/(2T), and most of the contribution to D(f) occurs for | f| near 

1/(2T). In this region we may approximate D(f) by exponentials. 
Finally, the output sample noise is 








No-) & 1 1 (yee 
(n2) = ————. aR NE RE 
ry m=_o l+m* (p-1)*°+m 
N —~1¢7% — ee 
-Nfiy.gct ne (38) 


Suppose we approximate P,,(f) of (34) by its m = 0 term, i.e., regard 
P,,(f) as white or equivalently neglect correlation between different 
noise samples (10). This corresponds to keeping only the m = 0 term 
in the output sample noise, in the first line of (38): 


eT fl—e7")] ~~ P.(f) white. (39) 


(n}) = 
m(p 
As p > 1 (39) and (38) both yield (n?) — N. This suggests that for 
expansion factors only a little larger than unity, the correlation be- 
tween noise samples may be neglected. 

The single-pole filter is special in that most of the general results 
may be written out explicitly. In contrast, approximations are used in 
the treatment of more general filters, such as the double-pole filter 
treated in the following section. 
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VII. DOUBLE-POLE FILTER 


We now consider maximally flat double-pole channel-bank filters. 
a(t) of (4) and its Fourier transform A(f) are: 


nh ne eae (40) 
dg (£) 
ty 
AC) = ms erultl [cosvEntef + sinvEnto| fi (41) 


t. is the 3-db half-width of the filter. We again assume that the spacing 
between adjacent filters is such that their transfer functions coincide 
at the 3-db points: 


T = 2.,. (42) 


The calculation of ¢m of (10) is straightforward but messy. Rather 
than perform it here, we rely on the example of the single-pole filter, 
treated in the preceding section, which suggests that correlation be- 
tween noise samples is unimportant. Consequently, we take ¢,, ~ 0, 
m # 0, thus approximating the noise of (13) and Fig. 1 as white; note 
that oo = 1. 

The transfer function H(f) of the reconstruction filter is determined 
by substituting (41) into (23). Unlike the single-pole filter of the 
preceding section, in the present case the inverse transform, yielding 
the reconstruction filter impulse response h(t), is not readily calculated 
exactly. Consequently, we must either invert H(f) numerically for 
each value of p, or use an approximation valid for p in the range of 
interest, here a little larger than one. 

First, we note that since |f| < 1/(2T), the maximum value of the 
argument (in exponential and trigonometric functions) that can occur 
in (41) and (42) is +/(2V2) = 1.111. Next, the first zero of (41) and 
(42) occurs when the argument is 37/4 ~ 2.356. Consequently, a zero 
of (41) and (42) never occurs in the range of interest, and consequently 
HA(f) of (23) will never have a pole. 

H(f) decreases monotonically as f varies from 0 to 1/(2T), and 
remains positive for 0 < p < 2. This suggests a parabolic approxima- 
tion: 


21-KQ),  Ifl<g 
Hif) (43) 
= 0, lfl > oT 


For Taylor series approximation K, = (?/8)(p? — 1). Alternatively, 
we may approximate H(f) by a parabola that agrees exactly at f = 0 
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and f = +1/(2T). For p = 1.05 the agreement is better than 0.3 percent, 
while for p = 1.1 the agreement is better than 0.75 percent. For this 
approximation 


e- 3" (co a p+ sin ate ») 
2/2 2/2 





K,=1- (44) 
% Tv 
e 2¥2 {cos —= + sin 
: ( 2V2 2V2 os) 
The corresponding filter impulse response is in either case 
sin 2 
h(t) = S ck 
in 2 
T 
3 2 
K, T si T T T T (45) 


+ eae od, 
ey 
se 

The distortion transform D(f) for the ideal case is again given by 
substituting (41) and (42) into (28). A similar expression is readily 
written out for the approximate transfer function (43), which yields 

additional distortion. 
The output sample noise, neglecting correlation between noise sam- 


ples as discussed in the paragraph following (42), with the approximate 
transfer function of (43), is 


(n2) = N(1 — K,). (46) 
VIII. EXAMPLE: DOUBLE-POLE FILTER, LORENTZ SPECTRUM 


Assume a Lorentz spectrum as input: 
1 


er aa (47) 
fa (£) 
ty 


Assume the channel-bank filter characteristic is given by (40), with 
separation given by (42). The samples x; (4) are 


(ao T T TY 
at e ) (1+) act Ge) | 
2 / op \ (aT (ne awe 

(i) (4 JE?) got cl 
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x(t) = 


(48) 


For line width large compared to the filter spacing, ty, > T, 


1 


. aT 
lim Y= = or (49) 
a, 2V2 po (2 ) 


» 2=5 
L 


The samples x,, (16) are found by substituting ty, — t<-/p in (48). 
The transform D(f) of the distortion d(t) of (25) is given as follows: 


Dif) = Af) X(f) aon 


1 1 
si-aled 
T T (50) 
+x(/+d)4(-1-#)]m0, I <gh 


1 
D(f) = -A(—pf)X(Pf), If| > oT. 





H(f) — | 


Here A(f) is given by (41) and (42), and X(/) is the transform of (47): 
X(f) = rtye sli, (51) 


For H(f) the ideal reconstruction filter of (23), (41) and (42), and (50) 
yields (28). The approximate H(f) of (43) yields additional distortion. 
Various approximations are necessary in different cases to evaluate 
d(t) as the inverse transform of (50). 

As a single illustration, assume a Lorentz spectrum with half-width 
equal to five times the filter spacing: 


ty 

—=5, 52 

= 5 (52) 
Assume a shrink factor 

p=1.1. (53) 


Figure 2 shows the transform of the desired output; Fig. 3 shows the 
transform of the distortion with the ideal reconstruction filter (23), 
(41), and (42); and Fig. 4 shows the transform of the distortion with 
the Taylor series approximate filter of (43) with K, = 27/8 (p?— 1). 
Note that the transforms in Figs. 2, 3, and 4 are normalized to 1/ 
(xT )?. 

For the ideal reconstruction filter, the principal contribution to 
distortion occurs near | f| = 1/(27'). The form of D(f/) in Fig. 3 in this 
region suggests that we approximate D(f) by exponentials; this yields 
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(X(NA(-ph)) (77)? 


1.75 


1.50 


1.25 
1.00 
ae p=1.1, SHRINK FACTOR 
j te/T =5, RATIO OF SPECTRAL HALF-WIDTH 


TO FILTER SPACING 
0.50 


0.25 





0 0.25 0.50 0.75 1.00 1.25 
2Tf 
Fig. 2—Desired output for the Lorentz spectrum. 
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Fig. 4—Distortion with Taylor series filter for the Lorentz spectrum. 


a closed-form approximation for d(t), and thence for the error d( pkT) 
in the reconstructed samples (29). Similarly, for the Taylor series 
approximate filter the form of D(f) in Fig. 4 suggests a Gaussian 
approximation for D(f), similar to that used in LaPlace’s method, to 
obtain an approximation for d(t). 

We shall not carry out either of these approximations here; rather, 
we will content ourselves with simple upper bounds on the maximum 
distortion. Clearly, 


dmax = A(t) |max < i. | D(f) | df. (54) 


For the ideal filter, we estimate from Fig. 3 that 
dmax < ~5 X 1078 (4?/2)T = 2.5 x 1077 T. 


This estimate will be too large; | d(t)|max will occur for t away from 
the origin, and consequently the maximum error will occur at the edge 
of the spectrum in (29). 

For the Taylor series approximate filter, we estimate from Fig. 4 
that dnax < ¥1.5 X 1074 (x?/2) T = 7.5 X 107 T. This estimate will be 
fairly accurate; d(t)|max occurs for t = 0, and consequently the maxi- 
mum error occurs at the center of the spectrum, k = 0 in (29). 

Finally, the noise in the reconstructed samples is given approxi- 
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mately for the Taylor series filter by (46) as 


(n2) = N F - . (p? — v| = 0.741N. (55) 


The noise for the ideal filter will be a little larger. 


IX. “SQUISH” 


We compute the distortion and noise in the current algorithm 
“Squish”, for comparison with the results of the preceding section. 
Squish may be described as follows. Associate with the samples x; of 
(4) the function 


s(t)= > x, rect (5 - r) ; (56) 
k=—0 
where we recall that 
1 |t| < 1/2 
rect(t) = (57) 
0, |t| > 1/2. 
Similarly, define 
a t 
s,(t) = 2. dr rect (4 — i : (58) 


Choose the {q;} such that 


f — kpT + rs 
i bor 2 [sg(t) — s(t)]dt = 0; (59) 
equivalently, 
| t—kpT + af 
ar = pT por - s(t)dt. (60) 


The q; are assumed to approximate the samples x,, of (16): 
dk = Xpz + dy + me, (61) 


where d;, and n, represent the distortion and noise from the desired 
output. 

The following results correspond to the parameters of (52) and 
(53). 
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k d;, (nk) 
0 -1.85 x 10° T 0.83 N 
1 —4.87xX10°T 0.76 N 
2 —-4.47X 10° T 0.65 N 
3 —1.59x10°T 0.57 N 
4 0.91 x 10° T 0.52 N 
5 2.18 x 10° T 0.5 N (62) 
6 2.51 X 10° T 0.52 N 
7 2.36 X 10° T 0.57 N 
8 2.02 x 10° T 0.65 N 
9 1.65 Xx 10° T 0.76 N 

10 0.12 x 10° T 0.83 N 


The peak distortion dmax = | dx|max = 4.87 X 107° T occurs at k = 1. 
The peak noise (nz) max = 0.83 N occurs at k = 0, 10, ---, i.e., when 
pk = integer; the variation of noise with k makes evident the nonsta- 
tionary nature of “Squish”. 


X. DISCUSSION 


We have shown how to relate the measurements obtained with a 
channel-bank spectrometer on two spectra, P,(f) and P,( pf), having 
similar shape but different scales along the frequency axis. This 
problem arises in radio astronomy, in determining whether two iso- 
topes have the same velocity distribution. 

As a single numerical example, we have considered a Lorentz line 
10 samples wide between its half-power points, analyzed by a channel- 
bank spectrometer with double-pole filters spaced so that their 3-db 
points coincide. We have determined the noise and distortion in 
computing the corresponding samples for a Lorentz spectrum 90.9 
percent as wide, by the present methods and with the current algorithm 
“Squish”. We summarize these results of Sections VIII and IX in 
Table I. 


Table |—Comparison between present and proposed processing 


methods 
Ideal Filter Taylor Series Filter Squish 
Maximum distortion <2.5x107T 7.5 X10°*T 4.87 X 10° T 
Maximum noise power >0.741 N 0.741 N 0.83 N 


These values are to be compared with the maximum sample value, 
Xo = 1.10 T, obtained from (48) with k = 0. 
In the present example Squish has a little more distortion and noise 
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than the Taylor series approximation to the ideal reconstruction filter; 
the ideal filter, of course, has much less distortion. For a narrower 
input spectrum, i.e., larger p, Squish will be relatively worse. 

Such comparisons may vary widely, depending on the spectrum 
under study and the channel-bank filter characteristic. The behavior 
in the main part of the spectrum may be quite different from that far 
out on the tails. 
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APPENDIX 
Output Statistics of Channel-Bank Spectrometer 


Figure 5 shows a typical section of a channel-bank spectrometer, 
consisting of a filter, square-law detector, and integrator. The input 
s(t) represents the noise whose spectral density P,(f) we wish to 
determine; P,(f) is real and symmetric, i.e., P,(f) = P*(f) = P.(-f). 
v(t) represents the receiver input noise, assumed white, with spectral 
density .” Assume throughout this appendix that the measurement 
starts at t = 0 and that the integration time is 7; then the integrator 
output w,(7) is the noisy sample of (8). 

We assume the receiver input noise is much larger than the noise 
whose spectrum is being measured: 


PAf)« % (63) 


The random “signal” s(t) and noise v(t) each produce a de and a 
random component at the outputs w;(t). The dc component of w;(-7) 
due to s(t) is the desired spectral sample. The dc component due to 
receiver front-end noise v(t) is uninteresting here. The random com- 
ponent due to s(t) is much smaller than the random component due 
to v(t); we ignore the former, and consider the latter as the “noise” in 
the sample. In summary, the dc component of w;(_7) due to s(t) (the 
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Fig. 5—Typical section of a channel-bank spectrometer. 
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noise being measured) corresponds to x; of (4) and (8); the random 
component due to receiver noise v(t) corresponds to n, of (8) and (9). 

Assume the channel-bank filters are equally spaced with the same 
shape; let B(f) be their common baseband equivalent filter. Then the 
kth channel-bank filter has transfer function 


B,(f) = B(f — kF) + B*(-f — RF), (64) 


where F represents the filter spacing. We assume that B(/), and hence 
B,(f), are causal; thus their respective Fourier transforms satisfy 
b(t) = 0 and b,(t) = 0 for t < 0. b;(t) is real, but b(t) is not necessarily 
real; while B;(/) is Hermetian, B(f) need not be. We also assume that 
the spectra of interest are narrowband, so that the {k} of interest in 
(64) are large enough such that 


B(f)=0, [fl > IRIF, (65) 


and the two terms in (64) are essentially nonoverlapping. 
Assume that v(t) = 0 in Fig. 5. Then 


f-<} 


(we(t)) = (ze(t)) = (yk(t)) = [ IBA PPs(fdf. (66) 


From (64) and (65), we have in terms of the equivalent baseband filter 
transfer function 


(wil 7)) = ie 2|B(f — kF)|’Ps(f)df (67) 


for the desired sample output. Since k and F are positive, by (65) the 
main contribution to the integral in (67) occurs close to f ~ kF, and 
there is no significant contribution to this integral for f < 0. Setting 
fot, FT, P,.() > x(), 2|B() |? — a(), and (wz( 7)) — xp, we 
obtain (4). 

Assume now that s(t) = 0 in Fig. 5. Then 


(wa(t)) = (zn(t)) = (yal) = 1 i _ |Bulf) Paf 


= vA 2|B(f)I°df, (68) 


the last step following from (64) and (65). Define 
Whac(t) = wp(t) — (w,(t)) 


69 
Zrac(t) = 2y(t) — (2e(t)). 2 

Then the spectral density of the second quantity in (69) is 
PA f) = 247 | B(f) |? © | Balf) 1’, (70) 
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where @ represents convolution. Now the integrator has impulse 
response 


1 
GF? O<t< F 
(71) 


0, otherwise. 


i(t) = 


The integrator transfer function, the Fourier transform of (71), is 


_ pit Z sin af 7 
(pre = a ge (72) 
Then 
(Wiac(t)) = i. |I(f) |?P2,..(f)df. (73) 


Now by (64), P.,,.(f) of (70) consists of a low-frequency part centered 
around f = 0, 


PRAf) = 44?| BCA)? © | B(-f)/?, (74) 


and a high-frequency part centered around f = +2kF. By (65) these 
two parts are essentially nonoverlapping. Since | I(/) |? becomes small 
for |f| >> 1/7, only the portion (74) will be significant in (73) if 
1/7<« |k|F, i.e., if the integration time is large compared to the 
reciprocal frequency being measured. This is always true, and hence 
we may replace P,, (f) in (73) by (74). Moreover, we assume 


1 : 
F < width of B(f). (75) 
Then (73) and (74) yield 
we [* 
(Whac(t)) = GF ie [2] B(f) |? Pdf. (76) 
Setting f > t, 2| B() |? > a(), and (w%,.( 7)) — N, (76) becomes 
We (* | 
ae f a*(t)dt (77) 


to be substituted in (9) and (10). 
Finally, we find the correlation between different noise samples 
(10); i.e., in the notation of the present appendix we seek 


(Wrac( 7). Wh'acl 7)). 
We set s(t) = 0, and have the following relations between cross- 
spectra: 
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Pye (f) = VB A)BE (Pf). (78) 


sin rf F ‘ 


Pose Af) Pa sec lf). Poet ; (79) 


We relate P.,,2,.,.(f) to Py,,(f) as follows: 
Pernctn'acT ) = (Zhac(t =F T )Zp'ac(t)) 
| = ( Vhac(t + 7) Yirac(t)) — (2(t))’, (80) 


where we omit the subscript from the final term because the expected 
detector outputs are the same for all channels by (68). Assuming the 
receiver input noise v(t) is Gaussian, the y,(t) are jointly Gaussian, 
and using (68) 


Pzpaczr'aclT) > DOs ous (7). (81) 


The Fourier transform of (81) yields the following relationship between 
the cross-spectra of different y and the ac components of corresponding 
2: 





Paasyalf) = BPA) © Proelf (82) 
Combining (78), (79), and (82) we get: 
Paranal f) = 242 | BEEZ | BA ABL(M) © BALABML. 6) 
Then 
| (hel t)uvat)) = J Posy At (84) 


By (75) we may approximate P,,,_w,..(/) of (83) by its value for f = 0 
in (84), yielding 


(ec(t)w0vee(t)) = = [i anrisenra. 5) 


From (64) and (65) we express (85) in terms of the baseband equivalent 
filter as 


2 fo] 
(hee(t)wwealt)) = [i ispp-2iBt- = eF) Pap (88 


Setting f > t, F— T, 2|B() |? > a(), (Whacl F)Wr'acl 7)) > (nenn), 
and k’ — k= x in (86), and using (77), we obtain (10). 
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Weighting Strategies for Companded PCM 
Transmitted Over Rayleigh Fading and Gaussian 
Channels 
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We present three methods of weighting N-bit u-law Pulse Code Modulation 
(PCM) with binary modulation, and evaluate their performances theoretically 
for transmission over Rayleigh fading and Gaussian channels. The digital 
modulation methods considered are noncoherent frequency shift keying and 
coherent phase shift keying. Ideal selection combining and ideal maximal ratio 
combining diversity techniques are employed when the transmission is over 
fading channels. Weighting System 1 is the conventional weighted pulse code 
modulated system, where the bits in every PCM word have the same weighting 
profile. Weighting System 2 has 2” unique weighting profiles, while System 3 
has an addendum to System 2, where every bit in a particular word is further 
weighted by a unique multiplicative factor. For Rayleigh fading channels 
and the encoder operating at an input level that provided maximum signal- 
to-quantization noise-ratio, we obtained gains in overall signal-to-noise ratio 
(s/n) over unweighted p-law PCM of 3, 4.5, and 6 dB for Systems 1, 2, and 3, 
respectively. When the systems were used in conjunction with Gaussian 
channels, the corresponding gains were 10, 12, and 17 dB, for a channel s/n 
of 10 dB. In addition to the theoretical results, we conducted computer 
simulations using four concatenated speech sentences transmitted via our 
weighted y-law PCM systems over mobile radio channels. The simulation 
performances were in good agreement with our theoretical results, which were 
based on input signals having an exponential distribution. 


* University of Lund, Sweden. t AT&T Bell Laboratories. * University of 
Southampton, England. 





Copyright © 1984 AT&T. Photo reproduction for noncommercial use is permitted with- 
out payment of royalty provided that each reproduction is done without alteration and 
that the Journal reference and copyright notice are included on the first page. The title 
and abstract, but no other portions, of this paper may be copied or distributed royalty 
free by computer-based and other information-service systems without further permis- 
sion. Permission to reproduce or republish any other portion of this paper must be 
obtained from the Editor. 


587 


I. INTRODUCTION 


When analog signals are PCM-encoded into binary words, the bits 
in any word represent different contributions to the decoded analog 
sample.’ For example, in 8-bit piecewise y-law Pulse Code Modula- 
tion (PCM),” the first bit in each PCM word represents the polarity 
of the speech sample, the next three bits identify the segment number, 
and the final four bits locate the value of the sample within the 
segment. Clearly, there is a hierarchical structure, with the most 
significant magnitude bit being the most influential in determining 
the accuracy of the recovered speech sample, and the importance of 
the subsequent bits declining rapidly until the least significant bit 
(LSB) is reached. 

Binary errors inevitably will occur to some extent in the regeneration 
process at the receiver, and therefore it seems reasonable to mitigate 
the effect of these errors by matching the energy of each bit prior to 
transmission in a manner dependent on their contribution to the 
accuracy of the recovered decoded sample. If this approach is adopted, 
the average energy used in the transmission of each binary word is 
arranged to be identical to that in conventional PCM, where all bits 
have the same energy. Thus the energies allocated to the most signif- 
icant magnitude bits are enhanced at the expense of the least signifi- 
cant bits whose energies are curtailed. 

This concept, due to Bedrosian,’ has an instant appeal as a technique 
for reducing the noise due to binary transmission errors in the re- 
covered speech signal. A PCM system that weights the binary levels 
by differing amounts, i.e., the binary amplitudes may be different for 
each bit in the PCM word, is known as weighted PCM. Sundberg” 
generalized the analysis of Bedrosian to include companded PCM, 
making use of the A factors, which he and Rydbeck conceived.** 

For transmission of unweighted PCM by means of a binary modu- 
lation scheme, the bit error probability is the same for all bits, 
independent of their position in the word. This is clearly not so for 
weighted PCM, where the weighting of the bit energies reduces the 
error probabilities of the more significant bits. As the average en- 
ergy (or power) per transmitted bit is constant, irrespective of 
whether weighting is applied, it follows that the error probabilities 
of the less significant bits must increase. However, the effect of 
weighting is to improve the overall signal-to-noise ratio (s/n) for a 
given channel s/n. 

Our intention in this discourse is to describe new weighting strate- 
gies for p-law PCM transmitted over Rayleigh fading and Gaussian 
channels by means of either binary Noncoherent Frequency Shift 
Keying (NCFSK) or binary Coherent Phase Shift Keying (CPSK) 
modulation. For each weighting profile and channel we will present 
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formulae for the noise power in the recovered signal due to the effect 
of transmission errors. The theoretical overall s/n is then presented 
as a function of channel s/n and compared to simulated results using 
speech signals. However, before we introduce our new weighting meth- 
ods we will briefly review the basic concepts of how transmission 
errors in companded PCM manifest as noise in the recovered analog 
signal. 


Il. DIGITAL NOISE IN CONVENTIONAL COMPANDED PCM 


In companded PCM the input sample x may be initially compressed 
to 


y = f(x), (1) 


where a common choice of f(x) is the n- or A-law.® These laws are 
defined as follows. For y-law PCM, 


log(1 + ux) . O<x<i1 
f(x) =) los + a) (2) 
ace =a), -1<x<0, 
and for A-law, 
_ Ax e 0 < < 1 
eee cel | 
_{ittlog(ax) 1 
faNs 1+logA ’ goes (3) 
—f(—x); -l<x<0. 


In stating f(x) we have followed the practice”® of normalizing the 
range of both x and y to the interval —1 to +1. The compressed sample 
y is quantized to y;, which can have one of 2” possible values. Encoding 
of y; into an N-bit binary word L; ensues, and the words are generated 
at a rate in excess of the Nyquist rate. The stream of binary words is 
suitably filtered and modulates a carrier for transmission. Figure 1 is 
a block diagram of a companded PCM system, although the input 
filter and sampler, and the final interpolating filter are not displayed. 

The receiver demodulates the incoming signal and regenerates the 
binary words. For the transmitted word L;, the regenerated word is 
L;,, where the subscript / signifies that one or more bits may be 
erroneously produced. We may represent L;; as the exclusive-OR 
operation on.the binary vectors L; and e;, where e; is an N-bit sequence 
whose subscript / identifies the error sequence. Thus, we may express 
Lint as 


Dit = L; ® el). (4) 
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Fig. 1—Block diagram of a conventional PCM system (filters and sampler not shown). 


For example, the error sequence 00110000 means that the bits in the 
third and fourth bit positions are erroneously regenerated. Observe 
that there are 2% — 1 possible e; sequences. We do not consider the 
all-zero e; sequence, as it implies no bit errors are present in the 
regenerated word. The word L;, is binary decoded to y;; and the 
recovered sample is 


xi = f(y). (5) 

If no transmission binary errors occur, the recovered sample is 
Xi = fyi) = Xi (6) 

and the overall noise power reduces to the encoder noise power 
ce = E{(x — xi)’}, (7) 


where the expected value is formed over the source statistics. However, 
we are not concerned here with the noise power generated in the 
encoder, but with the noise power in the recovered samples due to 
transmission errors, which we call digital noise power. This power is 


& = Ejiui{ (x: = xi}, (8) 


where the expectation is performed over all 2% levels of x;, and over 
all 2% — 1 possible error sequences e;. The set of 2% — 1 error sequences 
e; may be subdivided into groups containing single, double, --- , N-bit 
errors per PCM word. These groups are described by the Hamming 
weight, w, so that w = 1, 2, --- , N. The input signal and PCM encoder 
are independent of the channel imperfections enabling us to express 
€2 as 
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2N_j 


G = Py pil (x; — %i,1)7} 
QNn-1 
= 2 pA, (9) 
where 
A; & Ejf(xi — xi,1)} (10) 


are the A-factors, and p; is the probability of occurrence of the [th 
error sequence e;, corresponding to the A-factor A;. Observe that A; is 
the average noise power in the output sequence due to the existence 
of the specific error sequence e;, the expectation being made over all 
the values i, namely over the 2" quantized levels. 

When independent errors occur with average bit error probability 
P, then the probability of w errors in an N-bit word is 


P, = P®(1 — Py". 


Thus, when the error sequence e, has w bit errors in the N-bit word, 
P, = P,,. A case of particular interest is when no more than one error 
occurs in any PCM word and P < 1, whence P,, = P, and the digital 
noise power becomes 


N 
2=P3S A, 
[=1 


where A), Ao, --- , An, are the single-error A-factors for errors in bits 
1lto N. 


Il. WEIGHTED PCM 


The digital noise power «2 in a PCM system is determined using eq. 
(9), and we now seek to weight all the bits in the transmitted signal 
in order to reduce «2. We consider the case when single bit errors occur 
in any PCM word, knowing that for more severe error conditions the 
overall s/n is, in general, unacceptably low. Thus, for this case we 
approximate eq. (9) as 


N 
a= 2 piAt. (11) 
me 


We emphasize that consideration will be given only to binary modu- 
lation schemes. From eq. (10), 
2N-1 


A; = > px(xi)Ai(x; — x:,)?, (12) 


i=0 


COMPANDED PCM _ 591 


where x; and x; are given by eqs. (5) and (6), respectively, 
gi-N 
A; x) (13) 
is the quantization interval associated with x;, px(x;) is the PDF of 
the input sequence, assumed constant over the interval A;, and f’ (x;) 
is the derivative of f(x) at x = x;. 

The validity for this assumption is that there are many quantization 
levels, as indeed there are for our 8-bit p-law PCM encoding considered 
in this paper. In addition to our 256 quantization levels, we have 
normalized the ranges of x and y to —1, +1, and the compressed signal 
y is uniformly quantized.®? Writing 


a;(x;) = (xi — x41)", | (14) 


and observing that the probability of x being located in the quantiza- 
tion interval A; is 


Di = px(xi)A; (15) 


allows us® to express the A-factors as 
2-1 


A; = X pia, (x;). (16) 


Thus, the digital noise power is from eqs. (11) and (16): 
N 2N-1 
c= >) LY pipiar(x;). (17) 
I=1 i=0 
The probability p; of the error sequence e; occurring depends upon 
the modulation scheme employed, e.g., CPSK, NCFSK, whether di- 
versity reception is used, the channel type, the channel s/n, and the 
method of weighting the bits in the PCM words. We will describe 
three weighting schemes, which for brevity we will designate Systems 
1, 2, and 3. 


3.1 Weighted PCM System 1 


As a means of introducing weighted PCM we will consider binary 
amplitude modulation. For this situation each unweighted bit b; in 
every PCM word has its magnitude multiplied by V¢@,, where / = 1, 2, 
.--, N refers to the first, second, --- , LSB, respectively, and {¢)}; is 
the weighting profile for the PCM word, generated as | is varied from 
1 to N. For a particular bit, the /th, say, ¢; is also referred to as its 
weight. Observe that in this weighting scheme, each of the 2% words 
has identical weighting profiles. The N weighting samples V¢,, | = 1, 
2, ---, N, are stored in a circulating shift register. As each PCM bit 
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b,, l= 1, 2, --- , Nis generated, the appropriate value of JV, is removed 
from the shift register and is multiplied by b;. Thus the weighted bit 
applied to the transmission terminal equipment is 


Bi. = Voibi; b, = +1, (18) 


where the subscript 1 in B,,, signifies System 1. The receiver is identical 
to that of unweighted PCM, and the received bits are regenerated by 
observing at a sampling instant if 


B,, = 0; bit of logical 1 
B,,<0; bit of logical 0, (19) 


where B,,; is the received value of B,,). 
For the general modulation case, let the energy of the /th bit be 


E; = oi, (20) 
where E is the average energy per bit of the PCM signal, 
1 N 
E=— > E, 21 
N Py L (21) 
and by definition ¢; is the energy weight assigned to the /th bit. The 
constraint on the weights in System 1 is, with the aid of eqs. (20) and 
(21), 
N= Y ¢u. (22) 


Thus, from eq. (20), the channel s/n for the /th bit is 


T,; = @T; System 1, (23) 
where I is the average channel s/n, viz: 
E 
lr = — 
N, (24) 


and N, is the one-sided spectral density function of the additive white 
Gaussian noise. 


3.2 Weighted PCM System 2 


System 1 deploys fixed weighting profiles, where the same profile is 
used for every PCM word. This is a suboptimum weighting strategy. 
Observing that as the digital noise power depends on the value of the 
quantized sample x;, it appears desirable to have 2 different weighting 
profiles, one for each value of i. By this means we are able to select 
the unique weighting profile that best suits the particular PCM word 
to be transmitted. Thus, having generated a p-law PCM word we use 
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the index i to address a Read-Only Memory (ROM) that provides us 
with the optimum weighting profile {¢,(x;)}~, for that particular word. 
Weighting then ensues and the weighted /th bit has an amplitude 


Bor = Vor(x;) by; b = +1 (25) 


for binary amplitude modulation, where the input sample x is quan- 
tized to x;. The PCM decoder for this system, as with the other two 
weighted PCM systems, is the same as that employed in conventional 
PCM. 

Because the weighting depends upon the value of the quantized level 
x;, we amend eq. (23) to 


Ty (xi) = (xi); System 2. (26) 
3.3 Weighted PCM System 3 


In System 2 each profile {¢,(x;)}~, is optimum for the particular 
word associated with the quantized level x;, with the imposed con- 
straint that the word energy for every PCM word is the same. However, 
the digital noise power varies from word to word, and this leads us to 
the notion of a subsequent modification to the weighted bit Bo). 
Specifically, the amplitudes of the weighted bits in the word associated 
with quantization level x; are 


Bs, = VW; Bo; l = 1, 2, ances | N, (27) 


where W; is a multiplicative factor to the determined. Adopting this 
approach we arrange for the overall average word energy (and thus 
the overall average bit energy), rather than the individual word energy, 
to be constant. We will show that by combining the individual word 
weighting ¢:(x;) of System 2 with the multiplicative factor W;, a 
significant enhancement in overall s/n is achieved. In Section 4.3 the 
constraints on the factor W; are specified. 

The weighting strategy employed in System 3 results in a channel 
s/n for the /th bit and the ith quantization level of 


Ti(xi) = di(x:)WiT; System 3. (28) 


3.4 Determining the digital noise power 


In order to formulate the digital noise power of eq. (17) we commence 
with an expression for probability p,; that applies for a particular 
modulation scheme. This probability is dependent upon the long-term 
channel s/n, I, and we replace I by one of our values of I’; given by 
either eq. (23), (26), or (28). Thus we obtain an equation of «2 as a 
function of our weighting factor, namely, ¢), ¢:(x;), or ¢:(x;)W;. The 
optimum weighting factor is then found that minimizes the digital 
noise power, and thereby maximizes the overall s/n. This is the essence 
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of our approach, and in Section IV we present the derivation of the 
digital noise power in detail. 


3.5 Objective system performance criterion 


The probability of bit error is a poor measure of system performance, 
as we discussed in Section 4.1. Instead we opt for overall s/n, given by 
7 E{x*} 6 
Sloe é + e+ e ey (29) 
where E{x?} or o2 is the mean signal power, and ¢2, «2, and ¢2 are the 
quantization, clipping, and transmission error noise power components 
of ¢”, respectively. The derivation of eq. (29) is given in Ref. 3. By 
assuming that N is large, e.g., N = 8, the noise power generated in the 
p-law PCM encoder is ¢? + e2. The quantization noise power ¢% is 
produced in the quantization process when the input variable x is 
within the range of the quantizer, namely, —1, +1. Clipping noise ¢? is 
produced when x exceeds the range of the quantizer as the recovered 
signal amplitudes are truncated. The channel noise power ¢? is added 
to the encoder noise power to yield the total noise power. 


IV. WEIGHTED PCM FOR FADING CHANNELS 


The weighting profiles for PCM when the transmission is over 
channels subjected to Rayleigh fading will now be determined. Unless 
otherwise stated, it will be assumed that NCFSK modulation is em- 
ployed. 


4.1 Weighted PCM System 1 


Let us weight the bits in each word using the same profile for a 
given channel s/n. The bits are scrambled prior to transmission and 
on descrambling at the receiver the burst errors on the channel 
manifest as independent random errors. For NCFSK modulation the 
average bit error probability for M-fold diversity with Ideal Maximal 
Ratio Combining (IMRC) ‘is® 


9M-1 
Crs ue 


where I’ is the average per branch channel s/n. From eqs. (23) and 
(30) we identify p, of eq. (11) as 


P (30) 


9M-1 
~ (2+ bT)™’ 


and hence from eqs. (16) and (17) the digital noise power becomes 


pl (31) 
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QN_y gM-1 N gM-1 A; 


a= 5 x a;(xi) = = 


(24 ary™ A@rer™ 8) 


where the constraint of eq. (22) is applicable. In Appendix A, é is 
minimized if the weights 


ge ae 
= ya 9 (A,)Mt1 
(= ees —1 (33) 
i> (A; ai N Py (A; )M+1 


are employed. Observe that when no diversity is used M is unity, and 
for this case the average bit error probability P = 1/(2 + IT) is upper 
bounded by P < 1/I. This bound is tight for channel s/n values of 
interest, and calculating the weight profile ¢, using the upper bound 
yields the first term in eq. (33), which is independent of T. The 
approximate weighting profile that assumes the upper bound of P for 
M = 1 is also the optimum weights when I approaches infinity. 
Substituting ¢, of eq. (33) into eq. (32) yields the digital noise power 


gM-1 1 MTN _1 M41 
a=s aa (5) b cay ; (34) 


For weights other than those of eq. (33), the above expression is a 
lower bound on the digital noise power. The average bit error proba- 
bility from eq. (31) is 

9M-1 


(2 + Td,)%° 


1 


N 
Na ae 


— 


Pa = (35) 


= 5 
oN 


The large contributions to P,, derive from the least significant bits 
because their energy is less than in the unweighted case. The average 
bit error probability P,, for the weighted system does not equal P, and 
in general it is not a useful design parameter, since the effect of 
different bit errors on the recovered signal x;,, is radically different. 
Therefore, we avoid comparing the various weighted PCM systems on 
the basis of average bit error probabilities. Rather we make our system 
comparisons at the same average channel s/n, and use as our perform- 
ance measure the overall s/n given by eq. (29). The s/n is a good 
measure of this quality, and although for a given channel s/n the 
weighting may increase P,,, it nearly always reduces «2, and hence 
increases s/n. 

We point out that it is not essential to have N different bit weights 
to obtain significant reductions in «2 compared to unweighted PCM. 
Suboptimum weighting schemes with fewer weights than WN are dis- 
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cussed in Ref. 2, but we will confine ourselves here to the optimum 
weighting profile condition. 

On rare occasions the weighting profile may include negative 
weights, for example, when the channel s/n, I, is low in eq. (33). The 
occurrence of negative weights is primarily due to the oversimplified 
optimization procedures we adopted. Ideally the optimization should 
always include the constraint ¢; = 0, / = 1, 2, --- , N, even when we 
choose to ignore the effect of double errors, which become more 
probable for low I'’s and small weights. However, for high-channel 
signal-to-noise ratios the simplified optimization procedure employed 
here is quite sufficient. Negative weights can be avoided without this 
constraint if we elect to discard the second term in eq. (33), i.e., we 
employ the optimum weights appropriate for high-channel s/n. Of 
course, the low-channel s/n’s that result in a weight becoming negative 
are of no practical value, as the overall s/n, s/n, is then unacceptably 
low. However, the unrestricted optimization we have performed always 
provides upper bounds on the gains in s/n that apply to the different 
weighting schemes; bounds that are reasonable estimates of the gains 
in s/n that can be practically expected. Thus, in our simulations we 
employed the asymptotic weight, i.e., the first term in eq. (33), namely, 


we 
(A,)Mt1 
OF FON ks (36) 
N (ay 
k=1 


This weight was also employed in our simulations using CPSK mod- 
ulation. 


4.2 Weighted PCM System 2 


The individual word weighting profiles are dependent upon the a,(x;) 
terms in the A-factors [see eqs. (14) and (16)]. These a;(x;) functions 
enable us to know how the effect of an error in the /th bit varies with 
the input level x;. The reader is referred to Ref. 7 for a detailed account 
of the variation of a;(x;) as a function of x; for 1 = 1, 2, --- , 8, for 8- 
bit u-law PCM, p = 255. 

We now consider the digital noise power that occurs in this weight- 
ing scheme. First, the value of I';(x;) given by eq. (26) replaces I in eq. 
(30). The expression for p; so formulated is then substituted into eq. 
(17) to give the digital noise power of 

2N-1 N gM-1 


oe oy 2 a(x) 
= 2 2 (2 + oi(x,)T)™ 


Minimizing «2 with respect to ¢;(x;) yields the optimum weights 


(37) 
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1 1 
[ai(x;)]"* 2 [a;(x;)]“* 
gli) = Jw ot Sond Al (38) 
ine a,(x;) eae a,(x;)]¥A 
v 2! n(xi)] v2! n(Xi)] 
If we use the optimum weights specified by eq. (38), the digital noise 
power is given by 


‘ 9M-1 1 M 2N_-} N a oe M+1 
a, sf ; .)]M@M+1 
€a = (2 + r)™ (=) Py Di b [a:(x;)] | : (39) 


For high-channel s/n only the first term in eq. (38) need be considered. 
The variation of ¢;(x;) as a function of quantized level x; for 8-bit 
p-law PCM, p = 255, is displayed in Fig. 2 for each bit in the PCM 
word. The discontinuities in ¢;(x;) derive from the discontinuities in 
{an (%;)}po a In the case of ¢;(x;), where a;(x;) is a monotonic function, 
the summation of a;(x;) in the denominator of ¢;(x;) is responsible for 
the jumps observed in ¢(x;) in Fig. 2a. When diversity is applied, 
M = 2, the word weighting profiles ¢;(x;) for System 2 are altered, 
although they have similar shapes to those shown in Fig. 2. 
The average bit error probability is 


QN_1 1 N 9M-1 
= oe 40 
2 PEN 2 [2 + oi(x;)T a0) 


4.3 Weighted PCM System 3 
Let us commence our investigation into this form of weighting by 
introducing the constraint 
2N-1 E; E 
» Pi N. = N.’ 
i=0 0 o 
where p; is given by eq. (15). We introduce the word weighting factor 
W; by ensuring that the channel s/n for x; is 


E; E 

N, = W; N, ’ 
From eqs. (41) and (42) we formulate our imposed constraint on the 
word weighting factor W;, viz: 


(41) 


i= 0,1, 2,..++, 2% -— 1. (42) 


2N—1 

y pW = 1. (43) 

i=0 
The individual bit weights in the PCM words are given by eq. (38), 
and the previous constraint of eq. (22) with ¢, = ¢)(x;) still applies. 
We observe that although the word energy is no longer constant, the 
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Fig. 2—The variation of the word weighting profiles ¢,(x;) as a function of x; for 8- 


bit u-law Rayleigh fading channel PCM H = a, input signal level of —17 dB. The 
subfigures (a), (b), --- , (h), relate tol= 8, respectively. 
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average PCM word energy E is unchanged by this subsequent word 
weighting of W;. 

The minimization of the digital noise power is divided into two 
steps. The optimum bit weight profiles, ¢;(x;), are basically the same 
as in the previous section, with the exception that the word weights 
are now varied with the “word s/n”, i.e., E;/N.. The average digital 
noise power when the weighting set { W,} is included is, from eqs. (17), 
(28), and (30), 


Qn_1 N QM-1 


d= Ep Pe 


bolas weet’ (4s) 


The optimum weights ¢;(x;) given by eq. (38) enable e? of eq. (44) to 
be expressed as 


; 2N_} QM-1 
6 = Py Pi |, al B(xi), (45) 
where 
1 MTN ats M+1 
B(xi) = (=) b fats | 5 (46) 
i=1 


The function 6(x;) depends upon the magnitude of the quantized level 
and upon a;(x;), whose discontinuties cause 8(x;) to have sharp jumps 
as x; changes segments. 

The optimum word weights W; are determined by minimizing «2 
with respect to W;, employing the constraint of eq. (43). By this 
method the optimum word weights are 


[B(x:) #7 9 [B(x:)]¥ 


Uae ee ae cee A 
DY Pri B (xn) >» pel (xe) ]™ 
k=0 k=0 
Substituting W; into eq. (45) yields the digital noise power 
9M-1 QN_1 _1 )M+1 
oF [M41 
€a 2 + r)™ e PilB(xi)] | . (48) 
The average bit error probability is 
2N_-1 N M-1 
1 2 
Pa = 2 Pi >, (49) 


eye ee (1 + bi(x:) Wi) ° 
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4.4 Weighting for ideal selection combining diversity and NCFSK 

When Ideal Selection Combining (ISC) diversity is employed, the 
average bit error probability of an unweighted PCM system can be 
shown to be&® 


ee ee 1 
pais ( k )w (7 k\ 1 mu 
jae 8 ate”? 


where M is the order of the diversity. The channel s/n I’ is changed 
according to the method of weighting, as described in Sections 3.1 
through 3.3. For Systems 1, 2, and 3, I is replaced by J’, ¢;(x;)T', and 
g(x;)WT, respectively. Satisfactory values of the weights are the 
asymptotic weights derived for the different weighting procedures for 
ideal maximal ratio combining diversity. Thus, the weights given by 
eq. (36) are used for System 1; ¢;(x;) of eq. (38) is used when I tends 
to infinity [i.e., the asymptotic values of ¢)(x;), for System 2], while 
the same ¢)(x;) used for System 2 is also used for System 3, together 
with the asymptotic value of W; [see eq. (47)]. As before, the values of 
p, So determined are substituted into eq. (17) to give the digital noise 
power for the three systems. 


4.5 Weighting for CPSK modulation 


When Ideal Maximal Ratio Combining (IMRC) diversity is em- 
ployed with CPSK, the average bit error probability is®® 
1 \e1 


p= |) —-——— py (2-2) Rise: . (61) 


Z a. gf ] #1 k-1 4 
Ith 


The value of I in the above equation is changed according to eq. (23), 
(26), or (28), depending on whether System 1, 2, or 3 is employed. We 
employed the asymptotic weights derived for NCFSK and IMRC 
diversity for this case of CPSK modulation. The probability p; com- 
puted by this means is substituted into eq. (17) to yield the digital 
noise power. 

For the case of ISC diversity and CPSK modulation the average bit 
error probability is®® 


Be M 1 
fs 2 yt (MJ). (52) 


k=0 k 
1+ 
i 


1- 
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We convert P into p; as before by replacing I with the appropriate 
value given by eq. (23), (26), or (28), and then use the asymptotic 
weights derived for the case of NCFSK and IMRC diversity. 

4.6 Unweighted PCM 


For conventional PCM, i.e., where weighting is not employed, and 
bit scrambling is provided prior to transmission, the modulation being 
NCFSK, then the digital noise power can be shown to be’ 


eet 
= x Po (2+T)" 63) 


for IMRC diversity, where 
Ww N -j ~~ 
a Sw 1 (—1)""; =1,2,---,N, 54 
2 @ : i (1% w (54) 


and S,, is the sum of the A-factors associated with error sequences 
containing w ones. When ISC diversity is employed, the digital noise 
power is’ 


w 


2 as j 1 
a= z Lis = (-—1) e ; Vea ; (55) 


Tr’? 


Equations (53) and (55) are included as bench marks against which 
the digital noise power of weighted PCM can be measured. 


V. WEIGHTED PCM FOR THE GAUSSIAN CHANNEL 


For this channel there is no need for diversity, nor bit scrambling 
prior to transmission, as the bit errors are independent in nature. 


5.1 Weighted PCM System 1 


For NCFSK modulation and unweighted PCM the average bit error 
probability is 
E 


1 
P= 2 e 2No, (56) 





and when bit weighting is applied such that the /th bit, / = 1, 2, .--, 
N, has its amplitude individually adjusted, the expression for P is 
modified to eq. (23), viz: 


1 -#= = 
p= 2 e 2No, (57) 
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Substituting p,; into eq. (17) gives the digital noise power ¢2. In 
Appendix B we show that the optimum weights that minimize this 


digital noise power are 
A, 
] Se 
= () 








g=1+t+ ER? [=1,2,.--,N, (58) 
2N, 
where 
N 1/N 
A, = (1 A) (59) 
=] 


The optimum weights ¢, are thus dependent on the channel s/n, 
namely E/N,. Substituting the optimum weights ¢; into eq. (17) yields 
& = NA.P, (60) 


where P is the average bit error probability given by eq. (56). 
The average bit error probability for the unweighted scheme is, from 
eq. (57), 





E 
-¢ 


e@ 2No, (61) 


M2 
wole 


Po = 


Z| 


l=1 


When weighting is applied, and the weighting is optimum at each 
channel s/n, the average bit error probability is determined by substi- 
tuting ¢; from eq. (58) into eq. (61), viz: 


v2 (&) 
Pa = Re | |? 62 
ne \A, wa 
where P is the average bit error probability for NCFSK when no 
weighting is used [see eq. (56)]. 
5.2 Weighted PCM System 2 


As the individual word weighting profile ¢,; depends in this weighting 
scheme on the value of x;, we express ¢; as $;(x;), 1 = 1, 2, --- , N, for 
each of the 2% values of x;. In Appendix C the weighting profile is 


shown to be 
In a 
Qo (x;) 


o(x;) =1+ 





E 
2N. 
l= 1, 2, +, N, 
i=0,1,2,---,2%—1, (63) 
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where 


N 1/N 
ao(xj) = (1 a(x) ‘ (64) 
=1 
Thus we have the most significant bit (MSB), next MSB, --- , LSB, 
weighted by ¢1(x;), ¢2(x;), --- , 6n(x;), respectively, for quantized level 


x;, and these word weighting profiles are available for the 2% values of 
i, ie., the 2% different PCM words. When these optimum weights 
¢(x;) are employed, the average digital noise power is, from eqs. (17) 
and (63), 
2N-1 
«=P » piNa,(x;:). (65) 
i=0 
The average bit error probability for the optimum individual weight- 
ing profiles is, with the aid of eqs. (61) and (63), 
Py = PY Py BAe 
xo N i a,(x;) 


where P is the bit error probability for NCFSK. 





(66) 


5.3 Weighted PCM System 3 


The average digital noise power for System 3 is from eqs. (17), (28), 
and (56): 
2N~1 tale 
a= Po piNa,(x;) 9° aNe (67) 


i=0 


The word weighting factors are optimized such that «2 is minimized. 
This minimization is performed in Appendix D, from which 


2N-1 


W;=1+ 2 {inte - >} paitNaste 
E k=0 


b= 0, 1..2) tee 2 1: (68) 


The final optimum bit weight is found by multiplying W; with ¢;(x;) 
of eq. (63), where EF is replaced by E;/W;. When this is performed the 
minimum digital noise power becomes 


2N-1 
Y piln[Na,(x:)] 


@ = Pe : (69) 


The average bit error probability for this combined individual and 
word-by-word weighting is 


Qn N : _ WE 
Po = 2) Di \* »} aa e 2Ne. (70) 


i=0 i=1 Q,(x;) 
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Substituting W; from eq. (68) into eq. (70) yields 





2N—1 
ox, Pi I] [Nao(x:)P* N 
= 1 Ao (x;) 
Py» =P Y ~———. . = 71 
2 Na.(x;) N 2 a; (xi) oF 
5.4 Unweighted PCM 
The digital noise power can be shown to be” 
N N 
eR i) chad 0 ice ad ia a Je 6 odd (72) 
w=1 w=1 


and the value of P depends upon the modulation used. For NCFSK 


P= ; ee, (73) 
and for CPSK 
P = Q(v2P), (74) 


where Q(x) is the error function 


ioe] 


Q(x) = == J oP Pdp. (75) 


5.5 Peak power constraint 


The formulae for the previous weighting schemes have been derived 
without imposing a limit on the peak power. When such limitations 
are applied the optimization problem changes, and the digital noise 
power may increase. Thus our formulae provide upper bounds for the 
gains in overall s/n, or, alternatively, reductions in digital noise power, 
compared to unweighted PCM at the same channel s/n. 


VI. RESULTS 


In this section we present numerical and graphical interpretations 
of the formulae derived in Section IV. We also provide computer 
simulation results for speech signals that were subjected to our differ- 
ent weighting strategies and conveyed over simulated Rayleigh fading 
and Gaussian channels. The encoder was 8-bit y-law PCM, p = 255, 
where the quantized levels were binary encoded using a folded binary 
code. 


6.1 Results for the Rayleigh fading channel 


The constant word weighting profile ¢, of eq. (36) for M = 1, 
computed using A; of eq. (10), where / = 1 to 8, is presented in Table 
I for input powers of —17 and —40 dB. The individual word weighting 


COMPANDED PCM _ 605 


Table I—Constant word weighting profile {¢)}%1 for the Rayleigh 
fading channel (asymptotic weights) 

Input Bit Number | 

Power 1 2 3 4 5 6 7 8 


—-17dB 1.9432 2.5643 2.0376 0.7974 0.3553 0.1732 0.0860 0.0429 
-40dB 0.6156 6.0135 0.7787 0.3250 0.1445 0.0704 0.0349 0.0174 


WEIGHTING DIVERSITY 
SCHEME ISC, M=2|IMRC, M=2 


OVERALL s/n IN DECIBELS 





CHANNEL s/n IN DECIBELS 


Fig. 3—Rayleigh fading channel. Theoretical curves of overall s/n as a function of 
channel s/n for NCFSK and an input power level of —17 dB. 


profiles ¢,(x;) of eq. (38) are displayed in Fig. 2 for M = 1. The word 
weighting factors W; of eq. (47) are presented in Table II for input 
powers of —17 and —40 dB, and M = 1. 

The s/n given by eq. (29) is used as our objective performance 
measure. In the absence of transmission errors «2 is zero, and the s/n 
becomes o2/(e2 + ¢2). For an 8-bit y-law PCM codec, » = 255, the 
variation of o2/(e? + 2) as a function of input power o? is determined 
using well-established results.” Two input power levels of —17 and —40 
dB relative to the clipping level (+1) are considered, having a corre- 
spondence to voiced and unvoiced signal levels, respectively. Knowing 
o2 and o2/(e2 + 2) enables (c2 + 2) to be found. The next task is to 
compute the digital noise power «2 given by eq. (17). We consider the 
PDF of the source to be exponential 
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| iversity 


WEIGHTING] __—_—DIVERSITY 


SCHEME | None | Isc, M=2| IMRC, M=2 


OVERALL s/n IN DECIBELS 





0 10 20 30 40 50 60 
CHANNEL s/n IN DECIBELS 


Fig. 4—Rayleigh fading channel. Theoretical curves of overall s/n as a function of 
channel s/n for NCFSK and an input power level of —40 dB. 


1) = Be on (3 
PX\X%i ae xp Pe 


from which the 2% values of p; are determined using the 2% values of 
x;. The values of p; and the a;(x;) factors are substituted in eq. (16) to 
yield the set of A-factors A). 

The digital noise power «2 of unweighted N-bit u-law PCM, N = 8, 
p = 255, is found using either eq. (53) or (55). The variation of ¢2 as a 
function of channel s/n is next determined. For a given input power 
o2 the s/n depends only on ¢2, and hence I’. Consequently, we are able 
to obtain the graphs of theoretical s/n as a function of channel s/n for 
unweighted PCM, as shown by curves a in Figs. 3, 4, 5, and 6, where 
the curves in Figs. 3 and 5 and those in Figs. 4 and 6 apply for input 
levels of —17 and —40 dB, respectively. These unweighted PCM curves 
are included as references to which our weighted PCM curves can be 
compared. 

The optimum weights for high-channel s/n values for Systems 1, 2, 
and 3, i.e., asymptotic weights, are the first term in eqs. (33), (38), and 
(47), respectively. The digital noise power for the various schemes and 
conditions applicable to Figs. 3 to 6 is formulated with the aid of eqs. 
(32), (37), and (44), where the asymptotic weights are used. 

Figure 3 shows the curves of theoretical s/n as a function of channel 





(76) 
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Table II—Word weight factors W; for the Rayleigh fading channel 
(asymptotic weights) 
Input Power = —17 dB 


0.1204 0.1258 0.138138 0.13872 0.1432 0.1496 0.1562 0.1632 0.1704 
0.1780 0.1859 0.1941 0.2028 0.2118 0.2212 0.2310 0.2410 0.2517 
0.2629 0.2746 0.2868 0.2995 0.3128 0.3268 0.3412 0.3564 0.3722 
0.3888 0.4060 0.4241 0.4429 0.4626 0.4753 0.4964 0.5185 0.5415 
0.5655 0.5907 0.6169 0.6443 0.6727 0.7026 0.7338 0.7664 0.8003 


0.8359 0.8729 0.9117 0.9507 0.9929 1.0370 1.0829 1.1310 1.1811 
1.2335 1.2882 1.3451 1.4047 1.4670 1.5320 1.5999 1.6708 1.7448 
1.8222 0.4804 0.5020 0.5245 0.5481 0.5725 0.5982 0.6250 0.6530 
0.6805 0.7110 0.7428 0.7760 0.8104 0.8467 0.8844 0.9239 0.9436 
0.9857 1.0297 1.0756 1.1232 1.1733 1.2256 1.2802 1.3337 1.3931 


1.4551 1.5199 1.5870 1.6576 1.7313 1.8083 1.2848 1.3422 1.4020 
1.4646 1.5289 1.5971 1.6682 1.7426 1.8100 1.8906 1.9747 2.0626 
2.1530 2.2488 2.3487 2.4532 2.4303 2.5385 2.6511 2.7690 2.8900 
3.0184 3.1523 3.2924 3.4170 3.5688 3.7270 3.8925 4.0622 4.2426 
4.4305 4.6272 


Input Power = —40 dB 


0.3664 0.3827 0.3996 0.4174 0.4359 0.4552 0.4754 0.4966 
0.5185 0.5416 0.5657 0.5908 0.6171 0.6445 0.6731 0.7031 
0.7333 0.7659 0.8000 0.8356 0.8727 0.9115 0.9520 0.9943 
1.0384 1.0845 1.1327 1.1830 1.2356 1.2905 1.3478 1.4077 
1.4464 1.5107 1.5778 1.6478 1.7210 1.7974 1.8772 1.9605 


2.0472 2.1381 2.2329 2.3320 2.4355 2.5435 2.6564 2.7742 
2.8932 3.0215 3.1555 3.2954 3.4415 3.5941 3.7535 3.9199 
4.0931 4.2745 4.4640 4.6619 4.8685 5.0843 5.3096 5.5449 
1.4618 1.5275 1.5961 1.6678 1.7421 1.8202 1.9018 1.9870 
2.0708 2.1635 2.2603 2.3613 2.4662 2.5764 2.6914 2.8116 


2.8715 2.9996 3.1334 3.2732 3.4181 3.5705 3.7295 3.8956 
4.0586 4.2393 4.4279 4.6250 4.8293 5.0442 5.2683 5.5026 
3.9097 4.0844 4.2664 4.4568 4.6526 4.8601 5.0764 5.3027 
5.5078 5.7532 6.0090 6.2766 6.5515 6.8431 7.1471 7.4560 
7.3956 7.7246 8.0675 8.4263 8.7943 9.1852 9.5926 10.0189 
10.3980 10.8599 11.3413 11.8449 12.3614 12.9102 13.4822 14.0806 


s/n for NCFSK; an input power level of —17 dB; no diversity, and 
two-branch diversity, when IMRC and ISC techniques are used; and 
the effect of employing the three weighting schemes. Curves b, c, and 
d apply to weighting PCM Systems 1, 2, and 3, respectively, when 
diversity reception is not used. Comparing these curves to the un- 
weighted PCM curve a, we observe that over the range of channel s/n 
where these are parallel, the gain in s/n is 2.8, 4.5, and 6 dB, respec- 
tively. When second-order diversity is employed, curves e to | apply. 
Curves e, f, g, and h relate to unweighted PCM and weighted PCM 
Systems 1, 2, and 3, respectively; the diversity is ISC. We observe that 
the gains in s/n for Systems 1, 2, and 3 compared to unweighted PCM 
are significantly larger than those for the case of M = 1. Further, the 
systems are able to operate at a much lower channel s/n, I’, for the 
same s/n. When we use IMRC diversity the gains in s/n are similar to 
those achieved with ISC, but for a given I’, the values of s/n are 
increased by approximately 2 dB. 
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Fig. 5—Rayleigh fading channel. Theoretical curves of overall s/n as a function of 
channel s/n for CPSK and an input power level of —17 dB. 
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Fig. 6—Rayleigh fading channel. Theoretical curves of overall s/n as a function of 
channel s/n for CPSK and an input power level of —40 dB. 
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When the level of the input signal is reduced to —40 dB the curves 
in Fig. 4 are obtained. Curves a to | relate to the same system 
parameters as those in Fig. 3. As we expected the peak s/n (at high 
values of I) are reduced by 2 dB due to coarse quantization at low 
signal levels. The slopes of the curves in Fig. 4 are lower than those 
in Fig. 3. However, the curves of s/n for Systems 2 and 3 are approx- 
imately the same when no diversity is applied. Marginally greater 
gains in s/n occur at this lower power level of —40 dB when second- 
order diversity reception is employed. 

The theoretical curves for CPSK are shown in Figs. 5 and 6. The 
curves a to | in Figs. 5 and 6 correspond to those in Figs. 3 and 4, 
respectively. The essential difference between the curves is that for 
the same s/n, the I is 6 dB lower when CPSK is employed compared 
to when NCFSK modulation is used. 


6.1.1 Speech input signals 


Speech composed of four concatenated sentences, “Glue the sheet 
to the dark blue background,” “Rice is often served in round bowls,” 
“Four hours of steady work faced us,” and “The box was thrown beside 
the parked truck,” were used as our speech input signal. The former 
two sentences were spoken by females, the latter two sentences by 
males. The speech signal was bandlimited between 200 to 3200 Hz, 
sampled at 8 kHz, and stored in the computer. Eight-bit u-law PCM 
encoding ensued, the bit stream suitably weighted, and two-level 
CPSK modulation of a Radio Frequency (RF) carrier performed. From 
a hardware simulator of frequency-selective Rayleigh-fading mobile 
radio paths? samples of the envelope function C(t) were taken at a 
rate of 32 kHz and stored in the computer to provide the fading 
envelope of a mobile unit traveling at 15 mph. By resampling C(t) 
different vehicle speeds could be simulated. The regenerated /th bit at 
the kth instant was’? 


Byoa(k) = C(k)By(k) + I(k) = 0; ~~ logical 1 generated 
Boy (k) = C(k)Byy(k) + I(k) <0; logical 0 generated 
[=1,2,---,N, (77) 


where (-) is 0, 1, 2, or 3, depending on whether the PCM was 
unweighted or System 1, 2, or 3 was used, respectively. The amplitude 
of the Rayleigh envelope, the transmitted bit, and the additive inter- 
ference level are represented in eq. (77) by C(k), By.) .(k) and I(k), 
respectively. After bit regeneration the u-law PCM words were for- 
mulated and subsequently decoded. This decoded speech sequence was 
subtracted from its counterpart in the input speech sequence to yield 
the noise sequence. The s/n was computed as the ratio of the input 
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Fig. 7—Overall s/n as a function of input speech power for zero bit error rate. 
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Fig. 8—Mobile radio channel. Overall s/n versus channel s/n for CPSK and speech 
input signals. The input power level is -17 dB and the vehicular speed is 30 mph. 
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speech power to the noise sequence power. The curve of s/n as a 
function of input speech power is displayed in Fig. 7 for zero BER. 

Figure 8 shows the performance of our three weighting systems and 
unweighted PCM in the absence of diversity and when second-order 
ISC diversity was employed for an input level of —17 dB. The corre- 
sponding performance when the vehicular speech was 60 mph is 
displayed in Fig. 9. The effects of the nonstationary speech statistics 
and the duration of the speech and Rayleigh envelope data conspire 
to cause the curves in Figs. 8 and 9 to lose the parallelism of those in 
Fig. 5. However, the s/n advantage due to using diversity and our 
weighting systems when speech input signals are employed has a close 
correspondence with the gains predicted from our theoretical results. 
We note that the performance of the system as the vehicular speed 
doubles is, in general, very similar, although there are some large local 
variations. 

Figure 10a shows a segment of speech, while Fig. 10b, c, d, and e 
were the recovered speech waveforms for unweighted PCM, weighted 
PCM Systems 1, 2, and 3, respectively, when the vehicular speech was 
30 mph, an input level of —17 dB, CPSK modulation, no diversity, T 
= 16 dB. When ISC, M = 2 was employed and the other conditions 
remained unchanged, the results were the corresponding recovered 
speech waveforms in Fig. 10 f, g, h, and i. We observe that the effect 
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0 10 20 30 40 50 60 
CHANNEL s/n IN DECIBELS 


Fig. 9—Mobile radio channel. Overall s/n versus channel s/n for CPSK and speech 
input signals. The input power level is —17 dB and the vehicular speed is 60 mph. 
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Fig. 10—Segment of the speech signal. Overall s/n versus channel s/n for CPSK and 
speech input signals. The input power level is -17 dB and the vehicular speed is 30 
mph. The channel s/n is 16 dB and the original speech segment is waveform i. 






of employing Systems 3 and 2 results in a considerable reduction in 
digital noise power compared to the conventional unweighted PCM. 


6.1.2 Average error probability 


We assert in Section 4.1 that for weighted PCM systems the average 
bit error probability P,, is not a suitable performance parameter. To 
lend credence to this statement we display in Fig. 11 the curves of 
error probability for the MSB, next MSB, --- , LSB, namely pi, po, 

- , pg, respectively, as a function of channel s/n, for Weighted PCM 
System 1. Also shown is the variation of the probability of bit error P 
for a conventional, unweighted PCM system. The results are for a 
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Fig. 11—Curves of probability of error as a functin of channel s/n for the individual 
probability of bit errors p;, p2, --- , ps for System 1. It is a Rayleigh fading channel with 
no diversity, and in input level of —40 dB. Also displayed is the probability of error P 
for the unweighted PCM system as a function of channel s/n. 


Rayleigh fading channel, no diversity, and an input level of —40 dB. 
The weighting strategy for System 1 arranges for the LSB to have a 
lower transmitted energy than the other bits, and, consequently, a 
greater probability of error. However, an error in the LSB causes the 
smallest contribution to the digital noise power compared to the 
contributions from the other bits. We observe that pg > pz > pe > ps 


614 TECHNICAL JOURNAL, APRIL 1984 


> ps > p3 > po, i.e., the larger the contribution to the digital noise 
power caused by the regeneration of an erroneous bit, the lower is its 
probability of being in error. Probability p2 is associated with the most 
significant magnitude bit of the quantized sample, and we see that 
p2 < pi. This inequality originates in the optimization process, indi- 
cating that an error in the most significant magnitude bit will, in 
general, generate more digital noise than an error in the polarity bit 
of the reconstituted sample. The probability of error P for the un- 
weighted PCM is seen to be marginally smaller than p,, but signifi- 
cantly greater than pz. 

When the average probability P,, is computed according to eq. (35), 
we have the curve displayed in Fig. 12. As we expected from the curves 
in Fig. 11, P,, for Weighted PCM System 1 is considerably in excess 
of the probability P for unweighted PCM. Also shown in Fig. 12 are 
P., for Weighted PCM Systems 2 and 3. From our previous figures 
displaying s/n, and those depicting Py in Figs. 11 and 12, we conclude 
that systems that provide superior s/n also have inferior P,,. Hence 
we reject P,, as a meaningful performance parameter. However, knowl- 
edge of P,, does contribute to our insight into weighted PCM systems. 


6.2 Results for the Gaussian channel 


When the fixed weighting profile described in Section 5.1 is used, 
the digital noise power is given by eq. (60). The optimum weights ¢, 
of eq. (58), having the same A; factors as used in the unweighted case, 
are tabulated in Table III. However, we observe that ¢; is a function 
of the channel s/n. Applying the same procedure as for the theoretical 
results in Section 6.1, we change the weighting ¢,; with channel s/n to 
ensure that «2 is minimized. The curves in Figs. 13 and 14 are therefore 
optimum for every channel s/n. The improvement attained in s/n for 
Weighted PCM Systems 1, 2, and 3 compared to unweighted PCM are 
10, 12, and 17 dB, respectively, for an input signal level of —17 dB, 
respectively, when the channel s/n is 10 dB. The optimum fixed word 
weighting profile of System 1 enables the channel s/n to deteriorate 
by 1.5 and 2 dB for input signal levels of —17 and —40 dB, respectively, 
compared to unweighted PCM, while s/n is maintained at 30 dB. 
Observe that at low values of channel s/n where the BER is high, the 
application of fixed word weighting can increase s/n by 10 to 20 dB. 
These large gains can transform unintelligible speech into distorted 
but partially intelligible speech. We note that the gain in s/n when 
the individual word weighting profile is used instead of the fixed 
weighting profile is only 1 to 2 dB. 

The digital noise power «2 for the individual word weighting profiles 
of System 2 is given by eq. (65), where the a;,(x;) factors are given in 
Ref. 7. The individual word weighting profiles of eq. (63) are presented 
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Fig. 12—Curves of average probability for Systems 1, 2, and 3, and the probability of 
error P for unweighted PCM, as a function of channel s/n. 


Table III—The constant word weighting profile {@)}%, for the 
Gaussian channel (optimized for a channel s/n of 11 dB) 
Input Bit Number / 
Power 1 2 3 4 5 6 7 8 


~17dB 1.4612 1.5493 1.4763 1.1782 0.9213 0.6930 0.4707 0.2500 
—40 dB 1.3247 2.0489 1.3994 1.1217 0.8643 0.6355 0.4131 0.1924 
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Fig. 13—Gaussian channel. Theoretical curves of overall s/n versus channel s/n for 
an input level of -17 dB. Curves a, b, c, and d relate to no weighting, weighting scheme 
1, 2, and 3, respectively. No diversity was employed. 
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Fig. 14—Gaussian channel. Theoretical curves of overall s/n versus channel s/n for 
an input level of —40 dB. Curves a, b, c, and d relate to no weighting, weighting scheme 
1, 2, and 3, respectively. No diversity was employed. 
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Fig. 15—Gaussian channel. The weighting profiles of Weighting System 2. Subfigures 
a, b, --- , h, corresponding to a single bit error in the polarity bit, the most significant 


magnitude bit, --- , the least significant magnitude bit, respectively. 
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Table |\V—The word weighting factors W; for the Gaussian channel 
(optimized for channel s/n of 11 dB) 


Input Power = —17 dB 
0.0785 0.13338 0.1648 0.18938 0.2105 0.2297 0.2476 0.2645 0.2807 
0.2963 0.3116 0.3264 0.8410 0.3553 0.3694 0.3833 0.3971 0.4107 
0.4242 0.4876 0.4509 0.4641 0.4772 0.4903 0.5033 0.5162 0.5291 
0.5419 0.5547 0.5674 0.5801 0.5927 0.6054 0.6179 0.6305 0.6430 
0.6555 0.6680 0.6805 0.6929 0.7053 0.7177 0.7301 0.7424 0.7548 


0.7671 0.7794 0.7917 0.8040 0.8163 0.8286 0.8408 0.8531 0.8653 
0.8775 0.8898 0.9020 0.9142 0.9264 0.9385 0.9507 0.9629 0.9751 
0.9872 0.9994 1.0116 1.0237 1.0358 1.0480 1.0601 1.0723 1.0844 
1.0965 1.1086 1.1207 1.1829 1.1450 1.1571 1.1692 1.1813 1.1934 
1.2055 1.2176 1.2297 1.2418 1.2538 1.2659 1.2780 1.2901 1.3022 


1.3143 1.38263 =—-1.38384 =—-:1.3505 =:1.8626 §=61.3746 1.3867 1.3988 1.4108 
1.4229 1.3450 1.4470 1.4591 1.4712 1.4832 1.4953 1.5074 1.5194 
1.5315 1.5485 1.5556 1.5677 1.5797 1.5918 1.6038 1.6159 1.6279 
1.6400 1.6521 1.6641 1.6762 1.6882 1.7003 1.7123 1.7244 1.7364 
1.7485 1.7605 


Input Power = —40 dB 


0.6457 0.7005 0.7320 0.7566 0.7778 0.7970 0.8148 0.8317 0.8479 
0.8636 0.8788 0.8937 0.9082 0.9226 0.9367 0.9506 0.9644 0.9870 
0.9915 1.0049 1.0182 1.0314 1.0445 1.0576 1.0705 1.0835 1.0963 
1.1092) 1.1219 §=1.1847 1.1473 1.1600 1.1726 1.1852 1.1978 1.2103 
1.2228 1.2353 1.2477 1.2602 1.2726 1.2850 1.2973 1.3097 1.3220 


1.3344 1.3467 1.3590 1.3713 1.3886 1.3958 1.4081 1.4203 1.4326 
1.4448 1.4570 1.4692 1.4814 1.4936 1.5058 1.5180 1.5302 1.5423 
1.5545 1.5667 1.5788 1.5910 1.6031 1.6152 1.6274 1.6395 1.6516 
1.6638 1.6759 1.6880 1.7001 1.7122 1.7243 1.73864 1.7485 1.7606 
1.7727 =1.7848 1.7969 1.8090 1.8211 1.8332 1.8453 1.8574 1.8694 


1.8815 1.8936 1.9057 1.9178 1.9298 1.9419 1.9540 1.9660 1.9781 
1.9902 2.0022 2.0143 2.0264 2.0384 2.0505 2.0626 2.0746 2.0867 
2.0987 2.1108 2.1229 2.1349 2.1470 2.1590 2.1711 2.1832 2.1952 
2.2073 2.2193 2.2314 2.2484 2.2555 2.2675 2.2796 2.2916 2.3037 
2.3157 2.3278 


in Fig. 15. We used these weighting profiles to generate the curves for 
Weighting System 2 in Figs. 13 and 14. ; 

When the scheme of Section 5.3 is used the s/n is computed using 
eqs. (69) and (29). The word weighting factors are listed in Table IV. 
The gain in s/n when the individual word weighting factor W; is 
introduced is substantial, as shown by the performance curves of 
Weighting System 3. These gains in Figs. 13 and 14 exceed 20 dB 
compared to unweighted PCM when the input signal power is —40 dB 
and the channel s/n falls below 12 dB. 


VII. DISCUSSION 


In another paper’ we described the effect of transmission errors on 
p-law PCM encoded signals transmitted over Gaussian and Rayleigh 
fading channels. The essential theoretical results of that work are 
condensed here into Sections 4.6 and 5.4, and are employed as bench 
marks for measuring the gains in overall s/n when the p-law PCM 
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bits are weighted prior to transmission. Weighting System 1 is not 
novel. It is the original scheme due to Bedrosian! that has been 
analyzed by him and Sundberg.” However, the results for transmission 
of weighted p-law PCM by System 1 over Rayleigh fading channels 
are new. Systems 2 and 3 mark a basic departure from previous 
proposals for weighting. In System 2 we introduce the concept of 
instantaneous weighting, where the bits are weighted by a profile 
dependent on the quantization level. Thus there are 2” different 
weighting profiles, where N is the number of bits in the PCM words 
and these weighting profiles can be conveniently stored in a ROM. 
Observing that the digital noise power is dependent on the actual 
PCM words generated leads us to the notion of assembling the bits 
weighted by System 2 into their binary words and then weighting each 
bit in the word by a value unique to the word. 

We have presented theoretical results of overall s/n as a function of 
channel s/n for all our weighting schemes when the transmission is 
over Rayleigh and Gaussian channels, the modulation NCFSK and 
CPSK, and when ISC and IMRC diversity is employed. The overall 
s/n, i.e., the s/n of the recovered signal, is a meaningful objective 
measure of system performance. By contrast BER is a misleading 
performance parameter in weighted PCM. Indeed, the overall s/n 
generally increases when the BER increases. This is because the large 
contributions to the average bit error probability arise from the least 
significant bits that contribute least to the digital noise power. 

In addition to the provision of theoretical equations and graphs of 
overall s/n, we have provided simulation results when the input signal 
was composed of four concatenated speech sentences. The simulations 
were limited to two-level CPSK modulation and apply to a mobile 
radio channel. However, the simulations encompassed all the various 
weighting schemes, unweighted p-law PCM, no diversity, and ISC, 
M = 2, diversity. The results for speech agree satisfactorily with our 
theory, and the waveform presentation of Fig. 10 shows that the 
enhancement of the recovered waveforms has a matching correspond- 
ence with the measured gains in s/n. 

In summary we make the following observations. For the Rayleigh 
fading channel, and in the absence of diversity, Weighting Systems 1, 
2, and 3 yielded gains of s/n over unweighted y-law PCM of approxi- 
mated 3, 4.5, and 6 dB, when the input power was —17 dB. This was 
irrespective of whether the modulation was NCFSK or CPSK. At the 
lower input level of —40 dB, Systems 1 and 2 had the same performance 
with a gain in s/n of 6.5 dB. System 3 enhanced the performance by 
approximately 8 dB. Second-order diversity significantly extends the 
range of channel s/n before s/n begins to deteriorate. This is, of course, 
well known, as is the ability of IMRC diversity to outperform ISC in 
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terms of s/n for a given channel s/n. What was not known was the 
behavior of these diversity schemes when weighted PCM was intro- 
duced. We found that the relative performance of the three weighting 
systems with no diversity was repeated with diversity, enabling us to 
draw the general conclusion that System 3 always performs best, and 
that at low power levels there is no advantage to using System 2 
compared to System 1. F 

For Gaussian channel the improvement in s/n for Systems 1, 2, and 
3 compared to unweighted u-law PCM was 10, 12, and 17 dB, respec- 
tively, for a channel s/n of 10 dB, and an input level of —17 dB. Thus 
a s/n for unweighted p-law PCM of 13 dB was transformed to 30 dB, 
a value often associated with toll quality s/n. At the lower input level 
of —40 dB the gains for Systems 1, 2, and 3, become 20, 21, and 24 dB, 
respectively, i.e., Systems 1 and 2 had virtually the same performance, 
while System 3 only yielded a gain of 3 dB over System 2. 
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APPENDIX A 
Optimum Weighting Profile When System 1 Is Operating Over Rayleigh 
Fading Channels 

Consider Weighting System 1 operating with M-fold ideal maximal 
ratio diversity and NCFSK. The optimum weights that minimize the 
digital noise power «2 for this system are found by means of the 
Lagrange multiplier technique,” viz: 
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F= F(¢, care on,A) 
N M1 
=> A; 


N 
Egan t (Z -9). (18) 


where X is a dummy variable. Partially differentiating F with respect 
to ¢, and d and setting these derivatives to zero we have 


M2" 3p 
(2+ or)!" 


Using the constraint ¥%, ¢; = N, the ) value is given by 


Ar: = x. (79) 


1 


15 anes (atl (80) 
Nar NEMO 
Using expressions (79) and (80), we arrive at the optimum weights 
(33). 

Note that for the reason of simplicity of analysis we did not impose 
the physical condition ¢; = 0. In the solution to the mathematical 
optimization problem, optimum mathematical weights might be neg- 
ative for low channel s/n’s. 

The optimum weights (33) give the minimum digital noise power 


QM-1 1 M /N _1 \M+1 


where the optimum weight profile is used at each s/n I. For a fixed 
weight profile, (81) is a lower bound for those I’s, where the fixed 
weight profile is not optimized. 

The optimum weights for Systems 2 and 3 can also be found by 
means of the Lagrange multiplier method. In Appendices C and D we 
have shown the detailed derivations for the Gaussian channel. 


APPENDIX B 


Optimum Weighting Profile When System 1 Is Operating Over Gaussian 
Channels 


Applying the constraint that the average bit energy is constant, we 
determine the optimum weights that minimize ¢2 with the aid of the 
Lagrange multiplier technique,” viz: 


F = F(¢i, da; See on, d) 
x 1 ag x 
=) Aise m4 4{(5 01) M1, (82) 
=1 2 l=1 
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where d is a dummy variable. Partially differentiating F with respect 
to the /th weight, 

















OF a Bem ny ere fs (83) 
and upon setting 0F/d¢; to zero we have 
N=A x e" iN, (84) 
a constant. Writing eq. (84) as 
In(A) = In(A,) + In ( Al = oe 
and after considering all N values of I, we get 
N E EN 
N In(A) = » In(A,;) + N In (4) ~ ON, > i. (85) 
Applying the constraint of eq. (22) yields 
eonecen ee eee yun" (86) 
4N, " ; 
Defining 
NS as IN 
alfa] on 


and equating eqs. (84) and (86), we have the optimum fixed word 
weighting profile, 





; [=1,2,---,N. (88) 





APPENDIX C 


Optimum Weighting Profile When System 2 Is Operating Over Gaussian 
Channels 


Applying eq. (26) to eq. (56), and upon using the Lagrange multiplier 
technique, we have the function 


F = F(oy(x0) +++ di(Xo) «++ Ow (%o) ++ 
ve bilxi) +++ bw (X2v-1), Noy Ar +++ Ayes + Aew-1) (89) 
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2N-1 N E 
1 -¢i(xi) 
= ZY pidalx)5e  ™ 
i=0 I=1 
2N-1 N 
oy 3 gi(xi) — n| : (90) 
i=0 =1 
Upon partially differentiating F with respect to ¢;(x;), 
OF E -#t0 
= ‘aman > 2No + J 


and setting 0F/d¢)(x;) to zero we get 
i E -¢ix) #. 
= = a;(x;) 4N, e ane: 


a constant for each fixed i and / = 1, 2, --- , N. Applying the constraint 
on ¢)(x;), and following the procedure used in the derivation of eq. 
(88), we have the optimum individual word weighting profile, 


In ay(x;) 
Go (x;) 
E 
2N, 
l= 1, 2, ean) N 
p= .0) 4, 90355,9" =". (93) 


(92) 


oi(xi) = 1+ 





where 


Qo(x;) = (a; (x;)ao(x;) sce an(x;))/% 


a 1/N 
— it a(e)| , (94) 


l=1 


APPENDIX D 


The Optimum Word Weighting Factor When System 3 Is Operating Over 
Gaussian Channels 


In this case the Langrange multiplier technique yields 
F = F(Wo, Wi, We, ---, Wer-i, A) 


oN-1 WiE 


= } piNag(x:) 5 @ 


i=0 


2N-1 
+f Z pw, ~1), 


i=0 


(95) 
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whence 





OW; 
Upon 0F/dW; equals zero, 
_ WE 
an? 
a constant. We have assumed that 
Di # 0; P= 0; 1 2 oon 2" 


but if any p; is zero, the corresponding word weight is set to zero, there 
being no advantage in reserving energy for an event that never occurs. 
When some values of p; are zero the minimization is performed as 
above, but over those W,’s (fewer than 2) that have nonzero pj. 

Taking the natural logarithm of eq. (96) and applying the constraint 
of eq. (43) yields 


_ WE 
a = piNa.(xi) 5 : e 2Ne (- 2 + AD. 


\ = Na, (x; i=0,1,2,---, 2% —1, (96) 








QNn_} 
In(A) = » piln[Na,(x,)] + In (2) — om , (97) 


Thus from eqs. (96) and (97), the optimum word welgnting factor 
becomes 


W,=1+ = {inca = T paln(Nax (2) 
E k=0 
No 





bo 


i=0,1,2,---,2%—1. (98) 
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Recent work at AT&T Bell Laboratories has shown how the theories of 
Vector Quantization (VQ) and Hidden Markov Modeling (HMM) can be 
applied to the recognition of isolated word vocabularies. The initial experi- 
ments with an HMM word recognizer were restricted to a vocabulary of 10 
digits. For this simple vocabulary with dialed-up telephone recordings, we 
found that a high-performance, speaker-independent word recognizer could be 
implemented, and that the performance was, for the most part, insensitive to 
parameters of both the HMM and the VQ. In this paper we extend our 
investigations of the HMM recognizer to the recognition of isolated words 
from a medium-size vocabulary (129 words), as used in the AT&T Bell 
Laboratories airlines reservation and information system. For this moderately 
complex word vocabulary, we have found that recognition accuracy is indeed 
a function of the HMM parameters (i.e., the number of states in the model 
and the number of symbols per state). We have also found that a VQ that 
includes energy information gives better performance than a conventional VQ 
of the same size (i.e., same number of code-book entries). 


1. INTRODUCTION 


Vector Quantizers (VQs), Linear Predictive Coding (LPC) coeffi- 
cients, and Hidden Markov Models (HMMs) have been shown to be 
useful for a wide range of speech processing problems in the areas of 
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coding, synthesis, and recognition.’ ” In the area of speech recognition, 
there have been two distinctly different ways of applying HMMs. In 
the earliest work, extremely ambitious large-scale network models 
were used to model continuous discourse with constrained vocabular- 
ies.””® For these large networks, HMMs were derived for basic speech 
sounds (e.g., phonemes), and words were made by coupling together 
the individual sound HMMs. Similarly, a sentence was made by 
coupling the models of all the words in the sentence. Solving such 
large-scale networks (i.e., finding the best path through the network 
in order to decode the sentence) was a major problem and required 
very sophisticated network search routines to find good (generally 
suboptimal) solutions. 

More recently, work has been carried out on speech recognition 
based on HMMs for individual, isolated words.” For this type of 
system, an HMM is designed for each word in the vocabulary, and 
recognition is carried out by evaluating the probability that an un- 
known test pattern is the output of a given word model. The reference 
word whose model has the highest probability is chosen as the recog- 
nized word. To date, the word-based HMM recognizer has been tested 
only on a vocabulary of the 10 digits. For this vocabulary, it was found 
that the overall performance of the HMM recognizer was fairly insen- 
sitive to the parameters of both the HMM and the VQ, and that 
average word recognition accuracy was approximately the same as 
that obtained from a conventional Dynamic Time Warping (DTW) 
template approach. 

Based on the success of the word-based HMM recognizer for the 
digits vocabulary, we have attempted to extend the approach to handle 
a fairly complex, medium-size word vocabulary. The vocabulary we 
have chosen is the 129-word airlines terms vocabulary, which has been 
extensively studied in the context of an airlines reservation and 
information system.!* "© Table I shows the words in this vocabulary. 
We have studied the effects of varying several of the parameters of 
the HMM and the VQ on recognizer performance. Results indicate 
that a significant degradation in average word recognition accuracy is 
introduced by the use of a VQ, even with as many as 256 vectors in 
the code book. Results also show that the HMM/VQ system can and 
does achieve average word recognition accuracy comparable to a 
DTW/VQ template-based recognizer. Finally, we have found that, for 
a given size of code book, VQs using energy, along with LPC shape, 
give better performance than VQs with LPC shape alone. 

The organization of this paper is as follows. In Section II we briefly 
review the design of the conventional DTW recognizer, the design of 
a VQ based on either LPC shape alone or LPC shape plus energy, and 
the operation of the HMM recognizer. In Section III we describe the 


628 TECHNICAL JOURNAL, APRIL 1984 


Table I—Words in airlines vocabulary 


A A.M. Afternoon 
American April Are 

Area Arrival Arrive 

At August B.A.C. 
Boeing Boston By 

Card Cash Charge 
Chicago Class Club 
Coach Code Credit 
D.C. December Denver 
Depart Departure Detroit 
Diners Do Does 
Douglas Fight Eleven 
Evening Express Fare 
February First Five 
Flight Flights For 

Four Friday From 

Go Home How 

I In Information 
Is January July 

June Leave Like 
Lockheed Los-Angeles Make 
Many March Master 
May Meal Miami 
Monday Morning Much 

My Need New York 
Night Nine Non stop 
November Number O’clock 
October Of Office 

Oh On One 

P.M. Pay Philadelphia 
Phone Plane Please 
Prefer Repeat Reservation 
Return Saturday Seat 
Seats Seattle September 
Served Seven Six 

Some Stops Sunday 
Take Ten The 

Thee There Three 
Thursday Time Times 

To Tuesday Twelve 
Two Uh Want 
Washington Wednesday What 
When Will Would 


experimental evaluation used to measure performance of the various 
recognizers on the 129-word airlines vocabulary. In particular, we give 
a detailed analysis (for the two best recognition systems) of the 
individual talker error rates, the most prevalent types of word errors, 
and the extent to which errors made by DTW and HMM recognizers 
are disjoint. Finally, in Section IV we summarize our findings. 


Il. THE DTW AND HMM/VQ WORD RECOGNIZERS 


The two types of word recognition systems that we will be concerned 
with are: 
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1. A conventional DTW recognizer (either with or without VQ) 
based on LPC modeling and using isolated word templates as reference 
patterns. 

2. An HMM recognizer with quantized LPC vectors, using single- 
word Markov models as parametric representations of the words in 
the vocabulary. 

In this section we briefly review the implementations of the VQ, the 
DTW recognizer, and the HMM recognizer. 


2.1 VQ of LPC parameters 


Vector quantization is a technique for coding an LPC vector into 
one of M* code-book entries such that the average quantization 
distortion is minimized over some typical training set of LPC vectors. 
Vector quantization differs from the more conventional scalar quan- 
tization methods in that the entire LPC vector is quantized in a single 
pass, rather than quantizing each component of the vector by a 
separate quantizer. Experience indicates a substantial savings in re- 
quired code-book size (bit rate) for VQ over conventional scalar 
quantization.’* For HMM recognition purposes, VQ serves as a way 
of characterizing continuous LPC vectors by a set of discrete symbols 
(i.e., the indices of the code-book entries that provide best matches to 
the input LPC vectors to the recognizer). In addition, for the DTW 
recognizer, VQ provides a simple and straightforward way of trading 
off storage and computation in the calculation of LPC distances as 
required in the DTW algorithm.®” 

A VQ is designed from a training set of J LPC vectors, a;, i = 1, 2, 
..-, I, which are intended to be a good representation of the range of 
LPC vectors that occur when the words in the vocabulary are pro- 
nounced by a wide range of talkers. The VQ training algorithm 
determines an optimum set of code-book LPC vectors, An, m = 1, 2, 
--», M*, such that, for a given M%, the average distortion in replacing 
each of the training set vectors, a;, by the closest code-book entry, an, 
is minimum. 

If we define d(apr, ar) as the conventional LPC distance?” between 
the LPC vectors ag and ary, i.e., 


arVrar 


1 
afVrar 3 (1) 


d(ap, ar) = 
where V7 is the autocorrelation matrix of the sequence that gave rise 
to LPC vector az, then the goal of the VQ training algorithm is to find 


the set (of code-book entries), 4, such that 


| Dag || = min | ba min [d(am, ai} (2) 


1 
ni lI i=1 <m=<M* 
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is satisfied. The quantity || Dyy-|| is the (minimum) average distortion 
of the VQ with M* code-book entries. 

The way in which eq. (2) is solved has been intensively investigated 
by several researchers’ and will not be described here. As a result of 
these earlier studies, a highly reliable and robust procedure for the 
design of a VQ code book exists and can readily be implemented. 

Although the conventional VQ (which we call a shape VQ) used 
only the LPC vector, recent studies have shown how normalized frame 
energy can be incorporated directly into the local distance [eq. (1)]*® 
to give a shape plus energy VQ. For these designs we denote the LPC 
distance of eq. (1) as d_pc(arz, ar) and we denote an energy distance 
as dg (Er, Er). The total distance is then 


d(apr, ar) = dipc(ar, ar) + ady(Er, E>), (3) 


where Ep and Er are log energies of the reference and test frames, 
normalized to the peak energies in the word, and a is a multiplier for 
giving appropriate weight to the energy distance. Using the modified 
frame distance of eq. (3), a shape plus energy VQ can be designed 
according to the criteria of eq. (2) with no further modification. 


2.2 The conventional DTW word recognizer 


Figure 1 shows a block diagram of the conventional DTW word 
recognizer based on LPC modeling.’”!° The input speech signal, s(n), 
recorded over a standard dialed-up telephone line, is bandpass filtered 
between 100 and 3200 Hz, and digitized at a 6.67-kHz rate. The first 
step (preprocessing) consists of a first-order digital network, which 
provides a high-frequency preemphasis to the speech. The preemphas- 
ized signal is blocked into frames of 45 ms (300 samples), with each 
consecutive frame spaced 15 ms (100 samples) apart. An eight-pole 
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_ Fig. 1—Block diagram of conventional DTW word recognizer based on LPC model- 
ing. 


SPEECH RECOGNITION 631 


LPC analysis (autocorrelation method) is performed on each frame of 
the word (after the word has been isolated by means of an endpoint 
detector).”° Each resulting LPC vector is either used directly or vector 
quantized by means of a code book of size M*. The resulting sequence 
of LPC vectors, called the test pattern, is compared with each reference 
pattern in the reference template set using a DTW alignment algo- 
rithm that simultaneously provides a distance score associated with 
the alignment. The distance scores for all the reference patterns are 
sent to a decision rule, which provides a classification of the spoken 
word, and possibly an ordered (by distance) set of the best 6 candidates. 

The word reference patterns for the recognizer of Fig. 1 are created 
by a training algorithm. For speaker-trained applications, typically a 
single reference pattern is created for each word in the vocabulary 
using a robust training algorithm.”’ For speaker-independent appli- 
cations, a set of Q reference patterns is created for each vocabulary 
word using a clustering procedure.” Typically, about 12 templates 
per word are sufficient for recognizing words from a fairly homogene- 
ous adult population of native American talkers. 

If a VQ code book of M* entries has been designed as in Section 
2.1, one can compute and store the table of M* x M* distances 
between all pairs of code-book entries. In this manner, computation 
of distance between any pair of VQ code-book entries becomes a simple 
table lookup. Hence, if we vector quantize the LPC vectors of a test 
utterance and of all reference patterns, then the computation for 
distances in the DTW matching becomes trivial. In addition, a novel 
technique for reducing quantization distortion when using a VQ was 
proposed by Sakoe.”* For this technique the test vector is not quan- 
tized. Instead, the table of distances between each test vector and all 
code-book entries is computed once and used in the DTW distance 
calculation. In this manner there is reduced distortion, reduced storage 
(over conventional VQ), and essentially no computation for local 
distance calculation (it is still a table lookup procedure). 


2.3 The HMM word recognizer 


Figure 2 is a block diagram of the HMM word recognizer. The front- 
end processing, namely preprocessing, frame blocking, LPC analysis, 
and vector quantization, is identical to that used in the vector quan- 
tized DTW recognizer described above. The test utterance is reduced 
to an observation sequence, {O}, consisting of the indices of the code- 
book vectors that best match corresponding LPC vectors of the utter- 
ance. A Viterbi scoring algorithm determines, for each individual word 
HMM, the probability that the observation sequence was generated 
by the given word HMM. A decision rule either chooses the word 
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Fig. 2—Block diagram of HMM word recognizer based on LPC modeling and VQ. 


whose word model has the highest probability as the recognized word, 
or gives a list of word candidates ordered by model probability scores. 

Model probability scores for each word model are computed as 
follows. Each word HMM is an N-state model characterized by a state 
transition matrix, A, and a model output symbol probability matrix, 
B. Figure 3 illustrates an N = 5 state word model with M* discrete 
output symbols for each state. Here we assume that the word models 
are left-to-right models; i.e., the elements of the transition matrix, aj, 
satisfy the relationship 


ay = 0 Pst (4a) 
and, furthermore, we restrict the range of transitions to the case 
aj=O joitdad. (4b) 


That is, we allow transitions between states that are either adjacent 
or one apart. Previous experimentation has shown these constraints 
are reasonable.” 

Based on the above discussion, the scoring procedure for the obser- 
vation sequence O = {Q,, Oz, --- , Ox} (i.e., L indices of code-book 
entries), given the model M(=A, B), is as follows: 


1. Initialization: 6,(1) = log[b,(O,)] 
6\(1) =~,2<i1<N 
2. Recursion: For2 </<L,1<j<N 
6, (7) = max{6; —1(é) + logfay]} + logl>;(O,)] 
min(1, 7 — 2) <i < max(j, N) 
3. Termination: P(M) = 6z,(N). 


The above algorithm is a form of the well-known dynamic program- 
ming method and can be shown to have the property of determining 
the state sequence i = 1, 2, --- , iz, which maximizes P(O,i| M). It 
can be seen that if the entries in the matrices A and B are stored in 
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an 422 433 444 a55= 1.0 


412 923 434 445 
413 424 235 

by (1) b2 (1) b3 (1) b, (1) bs (1) 
by (2) bo (2) bg (2) bg (2) bs (2) 

e e e e e 

e e e e e 

e e e e e 
by (M*) bz (M*) b3 (M*) by (M*) bs (M*) 


Fig. 3—Typical five-state Markov model for word with transition matrix A = {aj} 
and output symbol matrix B = {b;()}. 


logarithmic format, i.e., log[a;], and log[b;(k)], then the computation 
of the Viterbi scoring algorithm requires no multiplications or loga- 
rithm computation. Hence, the speed of computation of the Viterbi 
scoring is quite high. 

The A and B matrices for each word HMM are estimated from a 
training set consisting of a set of observation sequences for the word. 
Starting with an initial estimate of the model, the probability, P, of 
observing the given training sequence O given the model M is com- 
puted. Using the Baum-Welch reestimation algorithm,” the model is 
iteratively adjusted to increase P. The iterations are stopped when P 
stops increasing significantly or when some other stopping criterion 
is met (e.g., when the number of iterations exceeds a limit). 


Ill. RECOGNITION EVALUATION OF THE AIRLINES VOCABULARY 


A series of recognition tests was run on both the DTW and HMM 
word recognizers, using the 129-word airlines vocabulary of Table I. A 
training set consisting of one replication of each word by each of 100 
talkers (50 male, 50 female) was used to generate speaker-independent 
word templates for the DTW recognizer, and speaker-independent 
word HMMs for the HMM recognizer. The code-book vectors for the 
VQ were also derived from a subset of the training data. (VQs derived 
from a digits-only vocabulary were also tried and produced essentially 
no degradation in system performance.) All word recordings were 
made over dialed-up local telephone lines, with a new line used for 
each talker. 

An independent test set consisting of one replication of each word 
by each of 20 talkers (10 male, 10 female) was used. None of the test 
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talkers had contributed to the training data. Again, all recordings were 
made over local, dialed-up telephone lines. 

A series of 14 separate recognition tests were made on the DTW 
and HMM recognizers. The runs consisted of the following systems: 


Run 1: Conventional DTW recognizer without VQ. 

Runs 2 and 3: Conventional DTW recognizer with VQ applied to 
both test and reference sets. 

Runs 4 through 9: HMM recognizer using a shape VQ with 
different values for N, the number of states, and M*, the size of the 
VQ code book. 

Run 10: HMM recognizer using a shape plus energy VQ. 

Run 11: HMM recognizer using a shape VQ with variable number 
of states per word. 

Run 12: HMM recognizer using a shape VQ with five randomly 
generated models per word. 

Run 13: HMM recognizer using a shape VQ using the average of 
five randomly generated models per word. 

Run 14: HMM recognizer using a shape VQ with both five and 
eight state models for each word. 


Recognition runs 1 through 10 are the standard cases of DTW and 
HMM recognizers, and they effectively provide performance measure- 
ments on the effects of HMM and VQ parameters. Run 11 is used to 
test the hypothesis that long words (in terms of phoneme count) need 
bigger models (more states) than short words. For this run the number 
of states in the Markov model was set equal to the number of phonemes 
in the word, and a lower limit of 4 states and an upper limit of 10 
states were imposed. Runs 12 and 13 examine the effects of generating 
multiple models (from different random initial model guess) for each 
word and either using all models in the recognizer (run 12), or the 
averge model (run 13). Finally, run 14 is used to study the effects of 
using multiple models with different numbers of states in each model. 
For this run, models with five and eight states were generated for each 
word. 


3.1 Recognition run results 


The performance results, given as a series of average word error 
rates for the 6 best word candidates, are shown in Table II and are 
partially plotted in Figs. 4 and 5. Table II gives the average word error 
rate scores for each of the 14 runs. Figure 4 shows plots of these results 
for runs 1, 2, 3, and 10, and Fig. 5 shows plots of the results for runs 
4 through 9. An examination of the curves in Fig. 4 shows that the 
use of the VQ leads to significantly poorer performance in both the 
DTW and the HMM recognizers (at least 6 percent for 8 = 1 and at 
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Table !]—Characteristics of recognizer and average word error rate 
scores (%) as function of candidate position for individual 
recognition tests on airlines vocabulary 

Number of Word Candidates 





Run Recognizer N M* 1 2 3 4 5 
1 DTW — — 9.4 4.0 2.4 1.7 1.4 
2 DTW/VQ — 64 20.1 10.7 6.0 4.2 3.5 
3 DTW/VQ _— 128 16.4 7.4 4.5 3.6 2.8 
4 HMM/VQ 5 64 27.2, 15.6 11.1 8.4 6.7 
5 HMM/VQ 5 128 22.5 12.3 8.2 6.1 4.8 
6 HMM/VQ 8 64 24.0 13.3 8.8 6.6 5.3 
7 HMM/VQ 8 128 20.7 10.3 6.4 4.8 3.9 
8 HMM/VQ 10 128 19.3 9.9 6.2 4.3 3.6 
9 HMM/VQ 10 256 17.6 8.2 5.1 3.8 2.9 

10 HMM/V@Q.E 10 128 15.5 7.0 4.4 3.1 2:2 
11 HMM/VQ Variable 64 30.5 19.1 13.4 9.9 8.1 
12 (HMM/VQ),; 5 64 304 184 19 91 «473 
13 (HMM/VQ); 5 64 30.4 18.4 11.9 9.1 7.3 
14 HMM VQ 5+8 64 24.3 13.5 8.9 6.7 5.5 


_-DYNAMIC TIME WARPING/ 
~~ VECTOR QUANTIZATION, M* = 64 


_DYNAMIC TIME WARPING/ 
—"“VECTOR QUANTIZATION, M* = 128 
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ERROR RATE IN PERCENT 
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NUMBER OF WORD CANDIDATES, 8 


Fig. 4—Average word error rate, in percent, versus number of word candidates for 
data of runs 1 through 3 and run 10. 


least 0.8 percent for 8 = 5). However, within the set of recognizers 
using a VQ, the HMM recognizer with a shape plus energy VQ achieves 
performance comparable to or better than the DTW recognizer with 
a shape VQ of the same size. 
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--N=8, M*=64 


-N=5, M*= 128 


-N=10, M* = 256 


ERROR RATE IN PERCENT 





NUMBER OF WORD CANDIDATES, 8 


Fig. 5—Average word error rate, in percent, versus number of word candidates for 
data of runs 4 through 9. 


The results in Fig. 5 show that the parameters N and M* of the 
HMM (and the VQ) do affect recognizer performance significantly for 
the airlines vocabulary. The results shown in this figure indicate that 
increasing both N and M* leads to improved word recognition accu- 
racy. In fact, a decrease in average word error rate of about 10 percent 
is obtained in going from N = 5, M* = 64 to N= 10, M* = 256. 

The results of the special runs (11 through 14) have some interesting 
implications. The results of run 11 (with variable-size Markov models) 
indicate worse performance than for fixed-size models. This result was 
anticipated based on experimentation with simulated HMM examples 
in which a stronger bias was always found for bigger models (more 
states) than for smaller models. The results of runs 12 and 13, in 
which five random starting conditions were used to generate five 
models per word, show a small but consistent improvement in per- 
formance when using all five models in the scoring (run 12) and a 
small but consistent degradation in performance when using the 
average model in the scoring (run 13). A plausible explanation of this 
result is that due to large model variability, averaging often results in 
a poor model because of the outliers. 
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TIME WARPING 





0 12 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 
TALKER NUMBER 


Fig. 6—Number of word errors as function of talker for data of runs 1, 3, and 10. 


The results of run 14, in which the five- and eight-state models were 
combined, show that the performance in this case is essentially iden- 
tical to the eight-state recognizer alone. Hence, this result again 
supports the observation that a strong bias exists in favor of bigger 
models. 


3.2 Additional analysis of runs 1, 3, and 10 data 


For the data of runs 1, 3, and 10, several additional analyses were 
performed to determine the correlation of individual talker errors with 
the recognizer, to examine the most frequent errors made, and to see 
how similar the errors were in the two recognizers. Figure 6 shows a 
plot of the number of word errors versus talker number for the 
recognition systems of runs 1, 3, and 10. It can be seen that there is a 
high correlation in the individual talker error rates among the three 
systems. It can also be seen that there is a high degree of variability 
in individual talker error rates. 

Table III shows, for the data of runs 1 and 10, the words with the 
highest error rates over all talkers, and the words most confused with 
these words. The numbers in parentheses indicate the total number 
of errors (across the 20 talkers) and the total number of confusions. 
For the DTW recognizer the confusions of these 12 words accounted 
for one-third of the overall word errors; for the HMM recognizer the 
confusions among the 14 worst words accounted for 30 percent of the 
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Table III—Major word confusions in data of runs 1 and 10 


DTW Recognizer HMM Recognizer 
Word Major Confusions Word Major Confusions 
By (9) I (5), My (4) Many (11) May (6) 
Do (8) To/Two (4) Oh (10) Of (3), How (2) 
Flight (8) Flights “6), Like (2) Time (10) Times (5), Five (4) 
May (7) Make (2) Pay (10) A (5), Take (3) 
Oh (7) How (3), Go (2) Flight (9) Flights (6) 
Is (6) In (5) Do (8) To/Two (8) 
Home (6) Oh (2), How (2) On (8) Of (3), August (2) 
Seat (6) Seats (3) Please (8) Leave (4) 
Time (6) Times (4) Miami (8) Monday (4), eer (2) 
Some (6) From (3) In (7) Evening (3), A .M. (2) 
A (6) Take (2) Want (7) What (4), One (2) 
Seats (6) Seat (4) Take (7) Bight 8), Seat (2) 
Seats (7) Sea 


Three (7) Ties (2), Many (2), May (2) 


overall word errors. For the DTW recognizer most of the major 
confusions are between words that sound similar (e.g., “By,” “I,” 
“My”); for the HMM recognizer there are many major confusions 
between dissimilar words (e.g., “In,” “Evening;” “Miami,” “Morning,” 
etc.). It is interesting that among the words most confused by both 
systems, there are only three overlapping words, namely “Do,” “Time,” 
and “Seats.” 

The final check on the data was a comparison of the errors made 
by the DTW recognizer (run 1) and the HMM recognizer (run 10). In 
earlier experimentation with the digits vocabulary, it was found that 
an orthogonality existed between the errors of the DTW and HMM 
recognizers, in that whenever the DTW recognizer made an error, the 
HMM recognizer was almost always correct. We have made the same 
type of orthogonality check on the data of runs 1 and 10. Of the 243 
word errors made by the DTW recognizer, 111 (45 percent) were cases 
in which the HMM recognizer also made an error. Hence, in only 55 
percent of the cases is there any potential for error detection and 
correction. Thus, we conclude that it would not be easy to combine 
the results of the DTW and HMM recognizers, for this vocabulary, to 
greatly improve overall word accuracy. 


IV. SUMMARY 


In this paper we have applied isolated word recognition based on 
probabilistic modeling (HMMs) to a medium-size vocabulary of airline 
terms. We have found that the use of a finite-size VQ leads to a 
significant degradation in performance for both the conventional 
DTW recognizer and the one based on HMMs. We have also found 
that after vector quantization the HMM and DTW recognizers can be 
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designed to give essentially identical performance. We have shown 
that increasing N, the number of states in the HMM, and/or M*, the 
size of the VQ code book, improves performance of the HMM recog- 
nizer. Hence, one should use the largest models that can be accom- 
modated in a practical implementation. We have also found that using 
a VQ based on both LPC shape and energy gives improved recognition 
performance over a VQ based on LPC shape alone. Hence, the energy 
contour is extremely helpful in recognizing word vocabularies with 
many polysyllabic words. 

The only discouraging finding was that there was a high degree of 
overlap between the word errors made by the DTW and HMM 
recognizers. Hence, there appears to be no simple method for combin- 
ing these two approaches to give greatly improved performance. 
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Probabilistic Analysis of interframe Tie 
Requirements for Cross-Connect Systems 


By C. L. MONMA* and D. R. SMITHt 
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Cross-connect system frames provide the capability for remote access and 
cross-connection of circuits in digital format. Their usefulness has resulted in 
a rapidly growing number of existing and planned offices with multiple frames. 
A key problem in the planning of these offices is the determination of the 
frame capacity that should be reserved for tying purposes. Ties, or connections 
between frames, are necessary to cross-connect two-point circuits whose 
segments are terminated on different frames. This paper provides the theoret- 
ical basis for the computation of interframe tie requirements. We describe two 
scenarios for the assignment of circuits to facilities, and facilities to frames. 
For each case we derive the mean and variance of the number of tied circuits 
per frame. These quantities can be used to determine the number of ports that 
must be reserved for ties so that circuits requiring ties between frames will 
only be blocked with a prescribed small probability. 


I. INTRODUCTION 


Automated cross-connect systems, engineered to be effectively non- 
blocking, are often used in special service applications. Legs of circuits 
are assigned to transmission facilities that are linked to a cross-connect 
frame via ports. These frames allow remote access to digital circuits 
for testing purposes and provide cross-connection of legs of a circuit 
remotely. Multiframe offices, necessary because of the limited capacity 
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of an individual frame, require planning so that sufficient capacity is 
reserved for interframe connections. This paper describes a method to 
compute the capacity required for this purpose. 

One such system is the Digital Access and Cross-Connect System 
(DACS). For an introduction to DACS, see Refs. 1 through 7. The 
analysis presented here is applicable to similar cross-connect systems. 

Each frame has a fixed capacity of ports. Currently, many offices 
require more than one frame because of their size, and the need for 
such multiframe offices is growing rapidly. Multiframe offices may be 
configured in two different ways: fully interconnected and tandem (see 
Figs. 1 and 2). 

Legs of two-point circuits that terminate on different frames must 
be cross-connected by ties (special connections between the frames). 
These connections are either made via facilities linked directly be- 
tween the frames in the fully interconnected case, or through a tandem 
frame. Since ties consume additional termination capacity on frames 
(as well as additional hardware), they should be minimized as much 
as possible. On the other hand, it is also important for planners to 
reserve enough tie ports on a frame so that circuit legs are not blocked 
because of lack of ties. 

Legs of a multipoint circuit are connected by means of a multipoint 
bridge, which is external to the frames, eliminating the need for ties. 
For the remainder of the paper, we assume that all ties are due to two- 
point circuits. The analysis is easily extended to treat the case of 


FRAME 1 FRAME 2 













FRAME 4 
BRIDGE 


Fig. 1—Fully interconnected office configuration. 
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Fig. 2—Tandem office configuration. 


multipoint bridges that are inside the frames. When such bridges are 
available, ties may be necessary to bring the legs of a multipoint circuit 
to the proper frame for bridging. 

This paper describes the theoretical basis for the computation of 
interframe tie requirements for an office for each of two scenarios. 
These scenarios assume different procedures for the assignments of 
circuits to facilities. We have developed a model for an office that 
incorporates these two scenarios as subcases, and we refer to this 
model as the “semicontrol model”. 


1.1 The semicontrol model 


In the semicontrol model, destinations are of two types: control 
destinations, for which the office administrator controls the assign- 
ment of circuit legs to facilities; and no-control destinations, for which 
the assignment of circuit legs to facilities is performed by someone 
else. It is assumed that circuit legs going to a no-control destination 
appear at random on facilities with capacity to that destination. This 
model also assumes that facilities to control destinations are balanced, 
that is, the numbers of facilities to a given destination that are 
terminated on different frames are as equal as is possible (they differ 
by at most one). Each two-point circuit is assumed to connect one 
control and one no-control destination. Multipoint circuits may con- 
nect any combination of destination types. The semicontrol model is 
consistent with networks of separately administered offices, since the 
assignment of a circuit leg between two points cannot be controlled 
by both end offices. 
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In this paper we analyze two subcases of the semicontrol model. In 
the first, tie requirements are computed for a mature frame subject to 
“churn” (circuits are connected and disconnected with time) without 
the possibility of planned rearrangement. Thus, once a circuit is cross- 
connected by a tie, it remains in this configuration throughout its 
lifetime, even if the possibility of a nontied path becomes available 
later (due to the disconnection of another circuit). In this subcase we 
develop a stochastic model for tied circuits, and solve for the mean 
and variance of the number of ties of the equilibrium distribution. 

In the second subcase of the semicontrol model, we assume simul- 
taneous assignment of circuit legs to the control facilities after obser- 
vation of the assignment of circuit legs to the no-control facilities. 
(We do not use this knowledge in the assignment of facilities to 
frames.) We compute tie requirements immediately after this assign- 
ment—before the effects of churn are felt. This subcase of the model 
is appropriate immediately after the cutover of a new office (when 
knowledge of circuits on facilities is not used in assignment of facilities 
to frames), or immediately after a planned rearrangement of the circuit 
legs to control destinations within an existing office. 

One should note that as time passes, if circuits are not rearranged, 
the total tie requirement will increase from the value computed assum- 
ing the second scenario until it reaches the value computed assuming 
the first scenario of the model. The speed with which this occurs 
depends on the value of the disconnect rate (sometimes referred to as 
churn rate), but the eventual value of the total tie requirement does 
not. 

The present two subcases do not adequately model a third scenario 
involving a growing office. In this scenario, demand increases, and 
new frames and facilities are added from time to time. The policies 
used to select the times for the introduction of new frames and new 
facilities are crucial in determining tie requirements during growth. It 
is conceivable that phased introduction of equipment can lead to tie 
requirements higher than those given by either of the two subcases 
described earlier. This phenomenon has been verified by simulation 
work done by P. Soni.® Eventual tie requirements for mature offices 
without planned rearrangements are still given by the first subcase. 

For both of the subcases, we compute the mean and variance of the 
number of tied circuits (on a frame) associated with each control 
destination. The mean of total tie requirements per frame can be found 
by summing the previously described means over the control destina- 
tions. The variance of total tie requirements per frame can be found 
by summing the previously described variances over the control des- 
tinations, under the assumption that circuits to different control 
destinations on a frame behave independently. The number of ports 
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necessary to guarantee that the tie requirement is met with a given 
probability is then obtained for the various configurations of frames, 
assuming that total number of tied circuits is normally distributed. It 
would also be possible to compute tie requirements by the equivalent- 
random method,” but the large number of control destinations typically 
encountered results in satisfactory accuracy of the normal approxi- 
mation. 

The methodology used here assumes that there is a large number of 
frames within the office. Care must be taken when interpreting the 
results for small offices. The calculations described herein will over- 
estimate tie requirements for offices of two or three frames. When 
four or more frames are involved, the tendency to overestimate tie 
requirements is probably minimal. 

The rest of this paper is organized as follows: Section II describes 
and analyzes the subcase of a mature frame affected by churn without 
planned rearrangements, and Section III describes and analyzes the 
subcase of a frame after an initial assignment of circuits. Section IV 
contains concluding remarks. 


Il. THE “MATURE” FRAME UNDER CHURN WITHOUT PLANNED 
REARRANGEMENTS 

Here we consider an individual frame within an office and compute 
the distribution of the number of tied circuits on that frame when the 
churn assumption is appropriate. We assume the previously described 
semicontrol model. That is, there are control and no-control destina- 
tions, circuit legs to no-control destinations appear at random on 
facilities to those destinations, two-point circuits connect one no- 
control and one control destination, and facilities to control destina- 
tions are balanced as much as possible on frames. We also assume 
that there are no planned rearrangements, i.e., once a circuit is cross- 
connected, it remains in that configuration for the duration of its 
lifetime. 

The lifetime of a circuit is assumed to be exponentially distributed. 
We consider a “mature” frame, so that the rate of replacement of 
circuits on a frame is essentially constant and equal to the long-run 
rate of circuit disconnects (the disconnect rate per circuit multiplied 
by the average number of circuits on the frame). The arrivals of 
connect orders having a leg to a given destination are assumed to be 
given by a Poisson process. We assume that the long-run average of 
the proportion of circuits to any given destination is constant, as is 
the proportion of two-point circuits. We also assume that these pro- 
portions are maintained on each of the no-control facilities, independ- 
ent of their destinations. The last assumption will tend to give higher 
values for the computed number of ties because it ignores the com- 
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munities of interest that may exist between the no-control and control 
destinations. 

The entire analysis of this section is concerned with circuits that 
have a leg to a given control destination and appear on a given frame. 
The mean and variance of the number of tied circuits on a frame due 
to these circuits will be determined. The mean (variance) of total tie 
requirements per frame will be found by summing the means (vari- 
ances) of the tie requirements for all destinations. 

We keep track of four types of circuits (see Fig. 3) for the given 
control destination on the given frame: 

1. Two-point circuits cross-connected through the frame—Here the 
two legs of the circuit are assigned to facilities appearing on the same 
frame. Obviously, no ties are used for these circuits. 

2. Two-point circuits tied out of the frame—Here the circuit has 
one leg on a no-control facility terminated on the given frame, but is 
tied to another frame for cross-connection to a facility going to the 
proper (control) destination. One tie circuit is needed on this frame 
for this type of circuit. 

3. Two-point circuits tied into the frame—Here the circuit has one 
leg on a no-control facility terminated on another frame, but is tied to 
the given frame for cross-connection to a facility going to the proper 
(control) destination. One tie circuit is needed on this frame for this 
type of circuit. 

4. Multipoint legs on the frame—Here a cross-connection appears 
on the given frame between a circuit going to the multipoint bridge 
and a circuit on a facility going to the proper (control) destination. 


FRAME 





NO-CONT ROL 
DESTINATION 


CONTROL 
DESTINATION 









TIES TO BRIDGE 


Fig. 3—Circuit types on a given frame. 
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No ties are needed for circuits of this type, although port capacity is 
used to connect these circuits to the bridge. 

We denote by X; the number of circuits of type i (described above) 
to a particular control destination. We show that the evolution of the 
state vector (Xi, X2, X3, X4) is given by a continuous-time Markov 
chain that is closely related to the evolution of the state vector for a 
classical overflow process. The latter has a known solution for the 
means and variances of its two random variables, which enables us to 
calculate the mean and variance of the equilibrium tie requirements 
(Xo + Xz). 

We assume that the total capacity of facilities to the control desti- 
nation, which are terminated on the given frame, is k circuits. Clearly, 
X, + X3 + X, is the number of these circuits in use, which cannot 
exceed k. 

State transitions change one of the components of the state vector 
by one unit (either up or down). These transitions correspond to the 
connecting or disconnecting of a circuit of the type represented by 
that component. 

We consider disconnect transitions first. Without loss of generality, 
we scale time in units of the mean lifetime of a circuit, so that the 
disconnect rate per circuit is unity. There is a transition corresponding 
to the disconnection of a circuit of type 1 at rate X; for each of the 
four types of circuits, since the lifetimes are assumed to be exponen- 
tially distributed. 

Now consider transitions corresponding to the connecting of cir- 
cuits. There are two kinds of these transitions. The first comprises 
those two-point circuits that have one leg connected to a no-control 
destination on the given frame (corresponding to circuits of types 1 
and 2). The second comprises those circuits having a leg elsewhere, 
which we assign to the given frame for cross-connection to the control 
destination (corresponding to circuits of types 3 and 4). 

The first kind corresponds to arrivals of circuit legs on no-control 
facilities of the given frame that require cross-connection to a partic- 
ular destination. We assume that these occur according to a Poisson 
process with rate ),. If there is capacity to the destination (X; + X3 
+ X, < k), these circuits are cross-connected through the frame to 
avoid ties. In this case each such event corresponds to the arrival of a 
type 1 circuit (X, is incremented by 1). If there is no capacity to the 
destination (X; + X3 + X, = k), these must be tied out of the frame. 
In this case each such event corresponds to the arrival of a type 2 
circuit (X_ is incremented by 1). . 

Now we consider events that cause type 3 circuits to connect. Every 
time a circuit is tied out of any frame in the office, it must be tied into 
another frame with available capacity to the destination. (It would 
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never be tied into a frame without capacity.) If the frame to be tied 
into is selected without regard to remaining capacity and without 
preference for a given frame, then the rate of such tied-in circuit 
connects should be approximately equal for all frames. Furthermore, 
this rate should be approximately equal for all states on the frame for 
which there is available capacity, at least if there are many frames in 
the office. 

Accordingly, we assume that the connects of type 3 circuits corre- 
spond to a Poisson process with rate dz, (still to be determined) 
independent of the state, provided that capacity exists to the desti- 
nation. Later in the section, Az will be determined by a self-consistency 
requirement. For the present, we assume it is known. 

In a similar fashion, it can be reasoned that requests for type 4 
circuit connects occur at rate ) 3 (still to be determined) roughly 
independent of frame and state, provided that capacity to the desti- 
nation is available at the frame. 

To sum up, we assume that the evolution of the state vector (Xj, 
X2, X3, X4) is given by a continuous-time Markov chain with the 
following transition rates [all from (x1, x2, x3, X4)]: 





Into State Rate 
(x1 — 1, x2, X3, x4) x1 
(x1, X2 — 1, x3, X4) X2 
(x1, X2, x3 — 1, x4) X3 
(x1, X2, X3, x4 — 1) X4 
(x1 + 1, x2, X3, X4) Mt (if xy + x3 + x4 < k) 
(x1, Xo + 1, X35 X4) M1 (if Xy $+ X33 + X= k) 
(x1, X2, X3 + 1, x4) d2 (if Xi + Xg34+X4< k) 
(x1, Xa, X3, X4 + 1) A3 (if Xp FXgtX< k). 


Our goal is to find the mean and variance of the number of ties 
(X_2 + X3) for the ergodic state distribution. 

If we introduce another component to the state space, having no 
effect on the first four components, the resulting ergodic probabilities 
can be related to the ergodic probabilities of the classical trunk- 
overflow process. Since the means and variances of the state variables 
are readily available for the trunk-overflow process,’ we are easily able 
to solve for the mean and variance of the required number of ties for 
a destination on a frame. 

We thus add a dummy component, X;, and also add the following 
transition rates from (x1, x2, %3, X4, Xs): 





Into State Rate 
(x1, X2, X3, X4, X5 — 1) Xs 
(x1, X2, X3, Xa, X5 + 1) 2 + Ag (if X1 + X3 + u4= k). 
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The other transitions are as before (ignoring the value of Xs), with Xs 
unchanged by the transitions. 

Define Y, = X, + X3 + Xa, Yo = X_ + Xs, and X = Ay + Ao t+ Ag. It 
is readily seen that the vector (Y1,Y2) is a continuous-time Markov 
chain with transition rate from (4, ye): 


Into State Rate 





(v1 — 1, y2) ¥1 
(41, Y2 — 1) yo 
(y1 + 1, y2) nN (if y1 < Rk) 
(y1, Yo + 1) nN (ify, = R). 


These are the transition rates for the classical trunk-overflow proc- 
ess. Physically, one may think of offering \ Erlangs of traffic to k 
primary trunks, with any overflow going to an infinite group of trunks. 
The first component represents the number of primary trunks occu- 
pied, and the second component represents the number of overflow 
trunks occupied. The moments of the ergodic distribution of these 
variables are available:® 


E(Y, + Yo) =), (1) 
var(Y; + Yo) =A, (2) 
E(Y,) = A(1 — BCR, d)), (3) 
E(Y7) = (1 + AE(Y;:) — RAB(R, d), (4) 
E(Y2) = AB(R, d), (5) 
and 

var(Y2) = E(Y2) F ~ E(Y2) + ao , 6) 

where B(k, X) is the classical Erlang blocking formula 
FE mec Le (7) 

2 (77/j!) 


We now relate the ergodic distribution of (X;, X2, X3, X4, X5) to 
the ergodic distribution of (Y,, Y2). Let gq; = \;/X. Whenever Yj, is 
incremented, X, is incremented with probability gi, X3 is incremented 
with probability q2, and X, is incremented with probability g3. Fur- 
thermore, departures from X,, X3, and X, are proportional to the 
values of these state variables. Therefore, given Y,, (X,, X3, X4) is 
distributed in a multinomial distribution corresponding to Yj, trials 
with probabilities of each type qi, q2, and q3, respectively. 
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Similar reasoning tells us that given Yo, (X2, Xs) has a binomial 
distribution corresponding to Y2 trials with probabilities of each type 
qi and 1 — qj, respectively. Also, given Y; and Yo, (X,, X3, X4) and 
(X2, X5) are independent. 

Thus, the ergodic probability of (x1, x2, x3, x4, x5), denoted 7(x1, Xe, 
X3, X4, X5), is related to the ergodic probability of (1, yo), denoted x* 
(y1, Ye), in the following fashion: 


(x1, X2, X3, X4, x5) = a* (x4 + x3 + X4, X2 + Xs) 


(x1 + X3 + x4)!(xo + x5)! 
93° 93'972(1 — % (8 
%4!xolxglacy!s! qrqsqsgi?(l — qi)**. (8) 


Equation (8) may also be checked by direct substitution into the 
balance equations, but we omit the details here. The facts stated 
previously, which allowed us to write equation (8), are sufficient to 
extract moments of the number of circuits tied. We proceed by con- 
ditioning on the values of Y; and Y2: 


E(X2 + X3| Yi, Yo) = qi Ye + qoVi1 (9) 
and 
E((X2 + X3)*)| Yi, Yo) = qi¥3 + qi(1 — qi) Ye 
+ 2q192¥Y1 Yo + q2Yi + qo(1 — q2)Yi. (10) 


Taking expectations with respect to Y; and Y2 in (9) and (10), we 
obtain (after some algebra) 


E(X2 + X3) = qiE(Y2) + qeE(Y1) (11) 
and 
var(X_ + X3) = qigevar(Yi + Yo) + ge(ge — qi)var(Yi) 


+ qi(qi — gz)var(Y2) + go(1 — q2)E(Yi1) + qi(1 — qiJE(Y2). (12) 


Equations (11) and (12), when combined with eqs. (1) through (6), 
give the mean and variance of the number of tied circuits to a given 
control destination on a given frame. These expressions depend im- 
plicitly on q; and gz, which depend on dz and A3. We will determine q, 
and qe in Section 2.1. 


2.1 Determination of X, qi, and q2 


We have so far analyzed a single frame with available capacity of k 
circuits to the particular destination considered. It is possible that not 
all frames in an office have the same number of available circuits to 
the destination (since the total number of facilities may not be a 
multiple of the number of frames), and these considerations affect the 
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determination of \2 and \3. We will actually find the derived quantities 
d, G1, and gz. Suppose that there are m types of frames (in terms of 
circuits to the destination) represented in the office, with available 
number of circuits to the destination of k,, ke, --- Rm (m will usually 
be two). Suppose that the proportions of the frames in the office that 
are of these types are Pi, Do, --- , Dm, respectively. We have argued 
previously that the values of \z and )3 are identical for all the frames, 
since these represent the rates of arrival of circuits that are selected 
to be connected to any frame in the office (having capacity) with equal 
probability. 

Denote the expected number of circuit legs to the control destination 
per frame (including two-point and multipoint circuits) by T. (This is 
known.) The (random) number of outgoing circuit legs from a frame 
to the destination is given by Y,. Consistency in the average number 
of legs to the destination per frame demands that the average value of 
the expectation of Y; equals T, or 


B part — Blk, NY) = 7, (13) 


where use has been made of (3). The left-hand side of (13) is increasing 
in A, is continuous, and takes all values in the range [0, c) for \ = 0, 
where c = >, p;k; is the average capacity to the destination on a 
frame basis. Therefore, (13) has a unique solution for \ as a function 
of T in the above quoted range. 

We also require, for consistency, that the total number of tied-in 
circuits in the entire office is equal to the number of tied-out circuits. 
Recall that X, represents tied-out circuits and X3 represents tied-in 
circuits, and their expectations are gq; E(Y2) and q2E(Y;), respectively, 
where E(Y,) and E(Y2) are given in (3) and (5). This consistency 
requires that the averages of these values over all frames be equal, or 


q1 » DiB(Ri; r) = do 2X pi(l oF B(k;, d)); (14) 


where a common factor of \ has been eliminated. Use of (13) in (14) 
with the identity 


M 


x (15) 


q >= 


yields 
_M_& 


q2 = TH (16) 
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2.2 Computation of the mean and variance of total ties per frame 


Equations (14), (15), and (16) allow one to compute the parameters 
q1 and q2 used in eqs. (11) and (12), which give the mean and variance 
of the total number of tied circuits on a particular frame. Equations 
(11) and (12) depend on the value of k, the number of circuits to the 
destination on the frame. When the value of & to a particular desti- 
nation is not constant over all frames, averaging over the values is 
necessary. Although the rationale for averaging the mean number of 
ties is clear, some explanation is necessary for the averaging of the 
variance (as opposed to taking the average of the second moment 
minus the square of the average first moment). 

The ultimate goal of this process is to obtain a number representing 
the reserved capacity for tied circuits on a frame, where reserved 
capacity is sufficient to meet tie requirements with a given (user- 
specified) probability. It can be argued that if the number of control 
destinations is large, circuits to each of the destinations behave roughly 
independently. Now choose a particular frame, which has 4 circuits to 
control destination i, 1 = 1, 2, --- , n, where n is the total number of 
control destinations. If n is sufficiently large, the total tie requirements 
for this frame will be approximately normally distributed. The mean 
is equal to the sum of the means calculated for each destination (given 
the number of circuits to that destination on the frame). The variance 
is equal to the sum of the variances calculated for each destination 
(given the number of circuits to that destination on the frame). 

Although the vector (4, 4, --- , 4) will vary from frame to frame, 
the components are definitely not independent, since the total outgo- 
ing capacity from a frame will be more or less fixed. In other words, a 
frame having a greater than average number of facilities to one 
destination will tend to have lower than average numbers of facilities 
to other destinations. 

If all destinations have identical characteristics (in terms of the 
number of frames with given number of circuits to the destination), 
then the proportion of 4 on a given frame that are equal to a particular 
value will be very close to the overall proportion (over all frames) of 4 
that are equal to that value. In this case, the total variance on the 
frame will be nearly equal to the sum of the average variances (con- 
ditioning on the number of circuits) to each destination. 

If all destinations are not identical, there will still be a tendency for 
the variance of tied circuits to be close to the sum of the average 
variances over the control destinations. 

For this reason it is reasonable to compute the variance of the total 
number of ties per frame as the sum of the average variance per 
destination. The latter is equal to the sum of the product of p; with 
the value of (12) when &; replaces k. 
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2.3 The mean ties for a destination per frame 


We obtain a simple expression for the average number of ties per 
frame. 
Equations (11), (3), and (5) give 


E(X2 + X3) = qiAB(R, A) + god(1 — BCR, d)). (17) 


The average number of ties per frame, which we denote v, is thus 
v= > pilqAB(ki, A) + g2d(1 — BCR, A))). (18) 
i=l 


Use of (13), (15), and (16) gives 
vy = 2d, (1 — T/)). (19) 


Note that ), is the expected number of two-point circuits per frame, 
so that 2), is the total ties if all circuits are tied. (Each tied circuit 
results in one tie in and one tie out.) The factor (1 — T/A) is the 
proportion of two-point circuits that are tied, where \ is determined 
by the implicit solution to (13). As T increases, it can be shown that 
1 — T/X increases. As T approaches c [facility fill goes to one, see 
definition following (13)], 1 — T/A goes to 1. 

As stated earlier, the mean total number of ties required per frame 
can be found by summing expected ties per destination as given by 
(19) over all destinations. 


2.4 Graphical representation of mean tie requirements 


It is possible to summarize the mean number of ties needed for a 
specific destination per frame in several families of graphs, each of 
which corresponds to a type of facility. These graphs give a value of v 
from (19) when X has been determined from (13) based on four inputs. 
These are 


number of circuits per facility (typically j = 12 or 24), 

average fill on these facilities, 

average number of facilities (to the destination) terminated per 
frame, and 

r = proportion of circuit legs to that destination that are from two- 
point circuits. 


J 
f 
a 


Figure 4 illustrates one set of these graphs, corresponding to analog 
group facilities (j = 12). The average number of tied circuits per frame 
is plotted versus the average number of facilities per frame (a) for fills 
(f) of 0.70, 0.75, 0.80, 0.85, 0.90, and 0.95. These curves assume that 
all circuits are two-point (r = 1). If this is not the case, expected ties 
obtained from the graph should be multiplied by the value of r. The 
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TOTAL TIES PER DESTINATION 





FACILITIES PER FRAME 
Fig. 4—Ties versus facilities for analog groups; churn model (12 circuits per facility). 


“scalloped” nature of the curves is due to the relatively higher mean 
tie requirement when the mean number of facilities per frame is not 
integer, forcing uneven spreading of these facilities on the frames. 

We now describe the method used to obtain the curves. If facilities 
are spread as evenly as possible, then m = 2, and 


T = afj, 
Pi=[a] —a, 
P2=1-— pi, . (20) 
ky =(fa] - 1), 
and kg = [a]j 


Here [x] represents the smallest integer greater than or equal to 
x. We can now determine \ by numerical solution of (13) and v, the 
average number of ties, can be obtained from (19). 


Ill. SIMULTANEOUS ASSIGNMENT OF CIRCUITS 


In this section we consider the second subcase of the semicontrol 
model in which circuits (and ties) are assigned to control facilities 
after assignment of all facilities to frames and after observation of the 
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assignment of the no-control legs of the circuits. This scenario excludes 
the possibility that circuits will be both tied in and tied out to the 
same destination on the same frame. Here again (as in the last section), 
there are two types of ties on a frame—those due to circuits tied out 
and those due to circuits tied in. 

We first obtain the distribution of the number of circuits tied out 
of a frame. By assumption, we know that the distribution of the 
number of no-control legs of two-point circuits on a frame that require 
cross-connection to a particular control destination is a Poisson dis- 
tribution. Ties out are required if this number exceeds the available 
number of circuits to the destination on the frame. In this case the 
number required is exactly equal to the excess. 

The distribution of the number of tied-in circuits on a frame depends 
on the methodology used to assign tied-in circuits. The total number 
of tied-in circuits in an office is equal to the number of tied-out circuits 
in the same office (this can be determined by the method described in 
the last paragraph). The specific way that individual frames are 
assigned these tied-in circuits is up to the office administrator. Of 
course, the number assigned to any particular frame cannot exceed 
the remaining capacity (after satisfying through-connects on the frame 
to the destination). Thus, circuits will never be tied into and tied out 
of the same frame (unlike the churn model of Section II). 

Here we compute distributions of tied-in circuits under each of two 
assignment policies. In the first, tied-in circuits are assigned to mini- 
mize the variance of ties on a frame. In the second, tied-in circuits are 
assigned to maximize the variance of tied circuits on a frame. The 
resulting variances provide lower and upper bounds for the variance 
of any assignment policy. 

We obtain first and second moments for the number of circuits tied 
in and tied out (for a given control destination) per frame. The second 
(first) moment of the total number of ties per destination per frame 
equals the sum of the second (first) moments of the number of circuits 
tied in and the number of circuits tied out. The second moment is 
additive since circuits are never tied into and tied out of the same 
frame in the simultaneous assignment case. (‘This can, however, occur 
in the churn model of the previous section.) 


3.1 Moments of circuits tied out of a frame 


Let X be the number of legs of two-point circuits incident on a 
frame that must be cross-connected to a given control destination, let 
k be the capacity in circuits to the controlled destination on a frame, 
let Y be the number of circuits tied out to this destination on the 
frame, and let Z be the remaining capacity available for tied-in circuits 
(or multipoints) on the frame. Then, 
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Y = (X — k)*, (21) 
and . 
Z=(k—-X)*, (22) 


where a* = max[a, 0]. 
Under the assumption that X is a Poisson random variable with 
mean \X, i.e., 


P{X = i} = (A'/iNe™, (23) 
we can compute the first two moments of Y: 
E(Y) = Sz — RSpa1, (24) 
and 
E(Y?) = XSp-1 + (1 — 2k)NS, + Sear, (25) 
where 
S;, = : (W/j!e™. (26) 
= 


That is, S, is the tail sum of the Poisson probability distribution. 
A good approximation for S, for large values of k is given by the 
normal approximation 


"te | 
aX ’ 


where ® represents the complement of the standard normal c.d.f.; that 
is, 


Sp * ® ( (27) 


&(y) = i see (28) 


When the value of k varies, i.e., a proportion p; of the frames has k; 
circuits to the destination for i = 1, 2, --- , m, then the appropriate 
average moments are found by taking the expectations of (24) and 
(25) with respect to the distribution p. 


3.2 Moments of the number of tied-in circuits 


We first find the distribution of the available capacity to the 
destination after the assignment of circuits cross-connected through 
the frame, but before circuits are tied in (the random variable Z). We 
obtain 


PiZ=j=((kR-Der, O0<isk (29) 
PIZ =i} =0, i>k: (30) 
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and 
k 


PiZ=0)=1- ¥ PZ =i}. (31) 


=1 


Equations (29) through (31) are based on a given frame with k 
available circuits to the destination. Let a; be the overall probability 
(considering all frames with possibly different values of k) that there 
is a remaining capacity of 1 circuits to the destination after the 
assignment of circuits cross-connected through the frame, but before 
circuits are tied in. We can compute aq;, i > 0, by conditioning on the 
value of k 


aj =e Y par“? /(kj — i), (32) 


j:kj2i 


where p; is the proportion of frames in the office with capacity k; 
circuits to the control destination under consideration. The maximum 
subscript 1 for which a; > 0 is max {k;}. 

We next compute the average number of circuits that must be tied 
in per frame, which we denote by B. This must equal the average 
number of circuits tied out per frame. Using (24) for various k values 
yields 


B = X PAS; = RjSi+1). (33) 
j= 


Now assume that there is a large number of frames. It is possible to 
have a proportion 0; of frames with i tied-in circuits to the destination 
under consideration V1 if and only if 


> 10; = B (34) 
i=1 
and 
2 0; < bY Qi, Vj = 0. (35) 
i=j i= 


Equation (34) follows from the fact that the expected number of 
circuits tied in equals the expected number tied out. Equation (35) 
states that the proportion of frames with i or more circuits tied in has 
to be no greater than the proportion of frames with capacity for i or 
more tied-in circuits. 

We now determine the values of 0; that give minimum and maximum 
values for the second moment (denoted by V) subject to the constraints 
(34) and (35), where 
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V= > 770;. (36) 


3.3 Minimum second moment 


Choose “such that 
v4 co f+1 oe) 
Yiat % Za < Bs ¥iajt Y + lai. (37) 
i=1 in7+1 i=1 in7+2 


If B < Sky taj, ie., if there is average capacity to handle tied-in 
requirements, then this is always possible. This follows because the 
left-hand side of (37) is a nondecreasing function of 7, with a value of 
0 for 7 = 0, and a value of D#, ia; for 7 = ©. Note that 7 is uniquely 
determined. Define 


Zz [-*} 
p=B-Yia;-— YX Cay. (38) 


i=1 i=4+1 


Now, the minimal value of V is produced by setting 


0; = ai, t= 73 (39) 
0,= > Qi — Pp; (40) 
i=Z 
0741 = p; (41) 
and 
0; = 0, i>Z+1. (42) 


Physically, this distribution is obtained when one assigns tied-in 
circuits sequentially in the following fashion: Assign the next tied-in 
circuit to a frame with available capacity having the fewest tied-in 
circuits already assigned. 

Clearly, ©; forms a valid probability distribution, since (37) implies 
that ©, and 0,4; are nonnegative, and also since the sum of the 0,’s is 
one. This distribution is also easily seen to satisfy (34) and (35). As a 
matter of fact, (35) is tight for all] < 4 

This last property uniquely specifies 9, i.e., a probability distribution 
6 satisfying (34) and (35) with the property that, for every m, if 


On11>0, then YO;=Ya, Vim (43) 


isj isj 
must be the distribution 0 given in (39) through (42). 


Proof: Choose n = sup{m:0,,+1 > 0}. The condition of the last paragraph 
tells us that 0; = a;, Vj < n. The normalization of © and its satisfying 
(34) uniquely determine 0, and 0,4. Also, 
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n+1 n-1 
B= ¥ i06;= Y ia; + nO, + (n + 1)Ona1. 
i=1 


i=1 


However, 
no,, + (n or 1)On+1 >n oy Qi, 
and 


nO, + (n+ 1)On41 <(n+1) Ya<nant XY (n+) a; 
i=n i=n+1 

Comparison with (37) shows that n = 4 and we are done. O 

We use this fact to show that the distribution © has minimum sec- 
ond moment over all distributions satisfying (34) and (35). Choose 
0’ ¥ O, which satisfies the constraints; 0’ must violate (43). Thus, we 
can find m andj < m with 0;,4, > 0 and 02, 0/ < YZ; a;. Choose the 
largest such m. Since both 0’ and a sum to one, we can find k <j with 
©; > 0. Choose the largest such k. The new distribution 0” with 


€ 


oe” = Pe : ; 
hk = Of ray? 
u” , € 
OU eae? 


On, =O,+ 6 
and 
On+1 = On+ — 6 


with ¢« = min[Oj41, (7 — k)Ok, (J — k) (SE; (ai — O/))] > 0, and other 
components identical to Q’, is still a probability distribution satisfying 
(34) and (35) and 


ive) fo] 


» 70/= 70; + (j + k — 2m —- 1), 


=1 i=1 


or 


> 70/7< J iO}. 
i=1 


t=1 = 


With a series of such transformations, 0’ can be transformed into 
0 O 


3.4 Maximum second moment 


We proceed in a similar fashion. In this case choose “such that 
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foe] 


> la;< Bs Dy 1a;. (44) 


i=f+1 i=f 


Define 


p* =B- > la. (45) 


t=f+1 


The maximum value of V is produced by setting 


0; = 0, 0<i1<Z (46) 
0,= p*/Z, (47) 
0; = Qi, L> 4; (48) 
and 
Q,=1-— ¥ @,. (49) 
i=1 


Physically, this distribution is obtained when one assigns tied-in 
circuits in the following sequential fashion: Completely fill the frame 
with maximum available capacity before assigning to another frame. 
Here we omit the proof, which is similar to that of the last subsection. 


IV. CONCLUDING REMARKS 


This paper describes the assumptions of a semicontrol model for 
facility and circuit assignment in a multiframe office. We compute the 
mean and variance of the number of ties per frame for each of two 
subcases. The first scenario assumes that we treat a mature frame in 
a churn environment without planned rearrangements. The second 
one assumes a simultaneous assignment of circuits, and is appropriate 
following the cutover of a new office or a wholesale rearrangement of 
existing circuits within an office. These computations are incorporated 
as subroutines in the computer program DACSPET (DACS Planning 
and Engineering Tool), which computes frame requirements and re- 
sulting port usages for a multiframe office. 


4.1 Preliminary insights 


The previously described computer program has been run on an 
assortment of sample data. These runs provide preliminary insights 
into the question of tie requirements for multiframe offices. We 
summarize these insights below. 

1. The effect of fill is clear in all cases. Higher facility fills imply 
more ties per frame. The reason for this is simply that higher fills 
imply less spare capacity on fewer facilites, and therefore less flexibil- 
ity in assigning circuits. 


662 TECHNICAL JOURNAL, APRIL 1984 


2. Tie requirements under the churn scenario are larger than under 
the simultaneous circuit assignment scenario. The effect is relatively 
minor at low facility fills, although it increases rapidly as the fill does. 

3. The size of an office, in terms of number of frames or circuits, 
seems to have less of an effect on tie requirements per frame than 
other variables, such as the number of control destinations served and 
the proportion of traffic to each destination. An office with many 
destinations, each with a small proportion of traffic, will clearly incur 
more tied circuits than an office with a few large destinations under 
the semicontrol model. 

4. The ratio of two-point and multipoint circuits also affects the 
number of tied circuits per frame under the assumption of an external 
multipoint bridge. For the same overall facility fill, the number of tied 
circuits is roughly proportional to the percentage of two-point circuits, 
all other factors being equal. 
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This paper describes a technique for assigning orders against specific reels 
of inventoried cable to ensure a high level of customer service, efficient labor 
utilization, highly usable cable lengths in inventory, and minimal scrap. Some 
orders are interrelated and thus linked, requiring that the assignment of each 
order be contingent upon the assignment of all linked orders. Due to this 
contingency, penalties associated with certain assignments, and selection 
allowances, the problem is quite complex. Our method of solution is formulated 
as a preemptive, multipriority, zero-one goal programming model used in a 
constructive initial placement algorithm. The model is primarily linear, but it 
also contains some nonlinear terms. The goal programming model and initial 
constructive algorithm run weekly in a production environment that handles 
up to 100 orders and 150 reels per problem. 


I. THE PROBLEM 


In 1981 Western Electric began offering 26 types of PULP Cable 
(Paper Insulated Telephone Exchange Cable) on a short-interval stock 
basis. Instead of making the cable to order as before, Western Electric 
would fill customer orders from reels of cable manufactured and 
inventoried at the stocking locations for weekly shipment. A method 
was required for selecting the most suitable reel of cable for filling 
each order while ensuring a high level of customer service, efficient 
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labor utilization, very usable cable lengths in inventory, and minimal 
scrap. 

Due to the manner in which this cable is installed at the job site, 
three specific selection requirements exist. First, only one reel of cable 
may be used to fill each order because the orders are for specific 
lengths. Henceforth, because of this length requirement, we will refer 
to the orders as subitems. Second, shipping less than the amount 
ordered is forbidden and only 10 feet of tolerance is allowed in the 
amount that can be shipped over the quantity ordered. This corporate 
shipping policy fortunately provides leeway in the selection process. 
Finally, in most instances, there exist several subitems for the same 
cable type, which must be installed at the same time. Subitems of this 
nature will be referred to as linked subitems in this paper. To this 
end, it is imperative that if any linked subitem is assigned from 
inventoried cable, then all subitems to which it is linked must also be 
assigned to some cable. Also, all linked subitems must be filled from 
the same stocking location. To accommodate the reader, each subitem/ 
reel of cable possibility will be termed a real variable. 

Inventoried cable is cable that has been completely tested and found 
to meet predetermined electrical requirements. Each time a piece of 
cable is cut, the newly cut ends must be tested. A 3-foot section of 
cable is lost in testing each end. This lost footage, henceforth referred 
to as the cutting allowance, must be considered in the selection process. 

Because of the subitem linkage, corporate shipping policy tolerance, 
and cutting allowance considerations, the method of solution required 
to solve this assignment problem is quite complex. Since each subitem 
must be completely filled from a single reel of cable, or left unassigned, 
a method was sought to provide optimal or near-optimal results in 
zero-one form, where a value of 1 indicates that the subitem is assigned 
to the reel of cable, and 0 indicates that the subitem is not assigned 
to the reel of cable. The problem then is to minimize: 


Z= » 2; Pilg (de, di)] 


ALL CONSTRAINTS ALL PRIORITY LEVELS 
k t 


subject to: 


UY (cyxiy + de — dz) — WY (cyxy + de — diz) 
+ Y > (Cixi +d, — dz) < 0, 
where 
P= preemptive priority / of constraint k 
g(d;, df) = mathematical operation on slack variables (dz, d) to be 
minimized 
cj = various constraint coefficients on real variables X;; 
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X;jj = 1 if subitem; is assigned to reel;, 0 otherwise 
U, W, Y = 0, 1 based on whether that term is needed in the constraint 
type (see Appendix A for specific formulation of con- 
straints). 


I]. HEURISTIC APPROACHES 


Zero-one assignment problems traditionally are solved using several 
techniques, including exhaustive search, branch-and-bound enumera- 
tion, and integer linear programming adaptations.’ These techniques 
become unusable, however, as the problem grows to large sizes. Since 
our problems sometimes consist of 1,000 objectives and 1,500 decision 
variables (for 50 subitems and 150 reels of cable), none of the tradi- 
tionally small-to-medium solution techniques proved suitable when 
investigated. 

Liggett’ describes two types of heuristic approaches, which will lead 
to good if not optimal results with small-to-medium size problems. 
The first is the iterative improvement approach, which consists of 
obtaining an initial suboptimal solution by any of several methods and 
improving this solution in incremental steps. This method is charac- 
terized by its quick generation of an initial suboptimal solution with 
a subsequent step-by-step improvement, which is generally quite time- 
consuming. This approach has been used with a simplex goal program- 
ming algorithm with subsequent heuristic improvements.” We at- 
tempted to implement this technique; however, due to the many 
conflicting objectives in our problem, most variables possessed frac- 
tional values as a result of the simplex starting solution. Adjusting 
this solution to zero-one form then tended to defeat the purpose of 
starting with a simplex-generated solution. 

The second method mentioned by Liggett! is constructive initial 
placement. Here a solution is built from scratch by adding variables 
to the solution set in successive steps that continually improve the 
results. Because zero-one form can be assumed from the start and 
each step improves the solution, this approach was more attractive for 
solving our assignment problem. Parker® interestingly compares var- 
ious approaches of both techniques and concludes that the constructive 
approach of Graves and Whinston‘ is superior to the iterative improve- 
ment method in the solution to the component placement problem, 
which is somewhat similar but smaller than our problem. Graves and 
Whinston use conditional probability in their algorithm as a general- 
purpose enumerative technique to select the next assignment to be 
added to the partially constructed solution. In lieu of conditional 
probability, we use a look-ahead feature before generating the model 
equations to accord high priority to certain assignments. We also 
restrict certain order-to-reel assignments that could exclude other 
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assignments that appear preferable. Our specific problem is depicted 
by the model generated (see the appendix formulation). The algorithm, 
however, is of a deterministic general-purpose nature and is much less 
application-oriented. 


Ill. METHOD OF SOLUTION 


Our approach to the problem was to develop an algorithm that 
would produce desirable solutions proceeding under the direction of a 
model containing the problem formulation. The technique is based on 
a preemptive multipriority zero-one goal programming model formu- 
lation, which is used in a constructive initial placement algorithm. 

Goal programming is a multicriteria, problem-solving technique that 
utilizes mathematical equations and an associated priority scheme to 
govern its solution. Slack variables (see d;, dj terms in the equations 
of the appendix) associated with each equation are ranked and 
weighted in an achievement function to govern the achievement of the 
desired results. It is absolutely necessary that higher-ranked objectives 
are satisfied before lower-ranked objectives, and that the optimality 
of the overall solution is maintained with higher-ranked objectives as 
lower-ranked objectives are solved. 

The algorithm that we developed to solve this assignment problem 
does this and with much more flexibility than could be realized using 
the simplex algorithm. Our algorithm can either maximize and/or 
minimize variables and, in fact, with this model formulation, makes 
considerable use of both. Since the algorithm is void of simplex 
restrictions, it also can perform nonlinear optimization. In this appli- 
cation, we minimize the reciprocal of certain slack variables, implying 
that if the slack variable cannot be driven to 0, then it should be 
maximized. 

Because we chose the constructive initial placement method, it is 
imperative that at each iteration, the real variable be chosen that will 
improve the solution most. To provide this, our algorithm uses the 
multiphase method of goal programming with entering variable tie 
logic. With tie logic, if several real variables improve the solution 
equally at a given priority level, then the improvement of each at the 
next lower priority level is used to decide which variable to bring into 
solution. Without tie logic in the algorithm, constructive initial place- 
ment would not provide satisfactory results. 


3.1 Contingencies 


Two conditions exist that involve assigning a set of related variables 
at the same time. Reel handling can be minimized, in part, by assigning 
several subitems to a single reel of cable when all or nearly all of the 
cable on that reel would be used. We adopted a straightforward 
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technique described by Petersen® to solve this type of contingency. 
Let variables Xj, Xi2, and Xj3 represent three different subitems that 
in combination would use nearly all the cable on reel,;. The variable 
Xi is generated to represent the assignment of all these variables on 
that reel. A mutual exclusivity constraint (described in the appendix, 
Section A.3, item a) ensures that X;, will not be allowed in the solution 
with Xj, Xi2, and X;3. If the combination is selected, then X;, = 1 and 
Xi, Xi, and X;3 = 0. If the combination is not selected, X;, = 0 and 
Xi; Xj, and Xi3 =0Oorl. 

The second condition, the subitem linkage contingency, is formu- 
lated in the model as an absolute set of constraint equalities. A set of 
constraints for three linked subitems would be: 


x Xp1 e- > Xpo =0 (1) 
pER pER 
y Xp2 y Xp3 = 0, (2) 
pER pER 


where Xpi, Xp2, Xp3 represent subitems 1, 2, and 3, respectively, 
assigned to any reel p where p is an element of the reels, of plant P. 

These equality constraints must be satisfied. If any of the three 
subitems were assigned, then the other two subitems also must be 
assigned to a reel of cable at the same plant. Bringing an X,; into 
solution, for instance, would produce an infeasible condition, unless a 
corresponding X,2 and a corresponding X,3 were also brought into 
solution. If this could not be done because of other constraints, then 
the assignment of subitems 1, 2, or 3 would not be allowed. 


3.2 The algorithm 


The algorithm incorporates two sets of real variables: (1) those in 
solution, designated the IS set; and (2) those in the NS set, which are 
out of solution but candidates for inclusion in the IS set. Using the 
constructive initial placement method, all variables are initially placed 
in the NS set and the initial value of the achievement function of the 
model is calculated. To determine which variable is brought into 
solution, the net effect on the achievement function is calculated 
individually for each member of the NS set and assigned to that 
member. This effect is determined by summing, for each member, the 
contribution of every slack variable of each objective that would be 
affected if the member were brought into solution. The NS set is then 
sorted by these net effects. Next, members are selected individually 
starting from the top of this set for “fit” within the model formulation. 
If the variable fits, it is removed from the NS set, placed in the IS set, 
a new achievement function value is calculated, and the NS set 
reprioritized. If a member cannot be feasibly fit, it is dismissed from 
the NS set and the next member is tried. This process continues until 
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the NS set is exhausted. The feasibility of the problem is then verified 
by testing the equality constraints. If any equality condition is not 
met, all real variables with a nonzero coefficient in the violated equality 
constraint are removed from both the IS set and the NS set to rectify 
the infeasible constraint violation. The NS set is reconstructed by 
placing into it all variables that are not in solution or that have not 
been eliminated because of this or any previously rectified equality 
constraint violation. The new achievement function value is then 
calculated and all variables in the NS set are reprioritized. The entire 
procedure from variable fitting through ensuring equality constraint 
feasibility is repeated until no equality violation is found. 


3.3 The model 


The model for this assignment application is formulated from input 
of a stock cable inventory and customer orders. For each product type 
we consider a separate problem. There are 14 possible objective types 
and 12 preemptive priorities. (See the appendix for the mathematical 
formulation.) 

1. Subitem contingency (Priority = absolute)—To guarantee 
timely shipments, all linked subitems must be assigned at the same 
time at the same stocking location. 

2. Reel capacity (Priority = absolute)—The reel capacity objective 
type ensures that the total of subitem lengths assigned to a real plus 
all cutting allowances cannot exceed the length of cable on that reel. 

3. Combinations (Priority = 5)—Since a reel of cable is eliminated 
from inventory and reel handling is greatly minimized, this objective 
type, which encourages a good fit of multiple subitems to a single reel 
of cable, is accorded high priority. To further minimize reel handling, 
the model encourages combinations involving the largest number of 
subitems rather than smaller combinations. 

4, Perfect one-to-one matches (Priority = 4)—This objective type 
encourages perfect and close (up to 20 feet short of cable length) 
individual subitem length to cable length matches. The benefits are 
those of eliminating reels of cable and of minimizing reel handling. 

5. Assignment of future subitems (Priority = absolute)—A late- 
ness factor is calculated for every subitem. It is based on when the 
customer wants the cable and takes into account the amount of time 
required to process and ship the subitem from the time it is received. 
Subitems that need to be assigned in this week’s processing cycle in 
order to arrive at the job site on time are considered current subitems. 
Future subitems need not be assigned yet. Hence, future subitems are 
not assigned unless their assignment eliminates short remnant pieces 
that had been generated by the assignment of current and late sub- 
items. Subitem linkage rules also apply to future subitems. 
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6. Assignment of current and late subitems (Priority = 6)—To 
guarantee that customers are satisfied, the assignment of current and 
late subitems is encouraged by this objective type. These objectives 
are weighted by the lateness factor of the associated subitem and more 
consideration is given to subitems the later they become. They are 
also weighted by order length so that the large subitems can be placed 
on the large reels first, while the large reels of cable are still available. 

7. Forced subitems (Priority = 6)—In emergency situations, sub- 
items can be entered with a force code indicating that they should be 
assigned before all nonforced subitems. Even though the preemptive 
priority is the same, the weighting factor on this objective type is 
larger than for any nonforced subitem, hence they receive greater 
consideration. 

8. Use of full reels (Priority = 7)—This objective type encourages 
using all or almost all the cable on a reel when customer orders are 
assigned. 

9. Minimization of scrap (Priority = 8)—This objective type min- 
imizes assignments that cause cable to be scrapped. If cable must be 
scrapped, it also minimizes the amount scrapped. It is the only nonlin- 
ear objective type in the formulation. 

10. Remnants of just over the scrapping limit (Priority = 9)—Since 
few subitems are desired for an amount less than 50 feet over the 
scrapping limit, this objective type is formulated to prevent assign- 
ments from leaving less than this amount on any reel, thereby keeping 
carrying costs to a minimum. 

11. Remnants of less than 500 feet (Priority = 10)—Since approx- 
imately 80 percent of the subitems are for 500 feet or more, assign- 
ments that leave remaining cable lengths less than 500 feet are not 
desirable. This objective type provides this. 

12. Use of minimum number of reels and largest reel (Priority = 
11, 12)—This objective type encourages using as few reels as possible 
(Priority 11). The use of the largest reel is also encouraged (Priority 
12). 

13. Vacation (Priority = 1, 2)—This objective type accommodates 
shutdowns of stocking location in a multilocation stocking environ- 
ment. To avoid adversely affecting customer service due to a scheduled 
shutdown at a particular location, subitems will be assigned against 
inventories at locations that are not shut down because of vacation or 
other causes. 

14, Transportation (Priority = 3)—Transportation charges differ 
depending on the distance from the stocking location to the customer 
job site. Overall transportation charges are minimized by this objective 
type. 
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IV. PROCESSING IMPROVEMENTS 


The algorithm has undergone considerable enhancement since the 
initial form was coded. Because it began producing good results using 
the constructive initial placement technique alone, the need to perform 
subsequent time-consuming iterative improvements was eliminated. 

After we obtained good results with the algorithm, the problem of 
computer core size and run time was looked at extensively. The amount 
of core required was quickly reduced by trimming what was unneces- 
sary in Lee’s goal programming simplex tableau.® Because the simplex 
tableau is not executed, the sparse simplex tableau was replaced by 
dense lists. This allowed us to handle a much larger number of 
variables and objectives. 

Reprioritizing the members of the NS set requires the largest 
segment of run time. This area was improved with several enhance- 
ments. When reprioritizing each member of the NS set, we found that 
many members could not be feasibly fit, given the set of variables 
already fit. Since constructive initial placement is a “block building” 
approach (once a variable is fit, it will never leave the solution at any 
time before the feasibility of the equality constraints is tested), we 
enhanced the procedure by removing the members from the NS set 
during reprioritizing. This considerably cut the size of the NS set. For 
instance, after a subitem is assigned, all associations of that subitem 
with other reels can be dropped from the NS set. Another large 
reduction was realized when we decided to recalculate the effect on 
the achievement function only for members of the NS set that are 
affected by the real variable just fit. 

A further reduction in the execution time was realized in the 
handling of future subitems. Since a future subitem is only fit to 
prevent a remnant cable length less than 500 feet from being generated, 
the futures only need to be considered after all the current and late 
subitems have been considered. Also, only future subitems less than 
500 feet need be considered since those over 500 feet will not improve 
the solution. 

Run time for solving all 26 problems has varied based on the weekly 
mix, from 30 seconds to 30 minutes in the central processing unit 
(CPU) on an IBM 3033. 


V. OPTIMALITY CONSIDERATIONS 


This algorithm is constructive initial placement in the strict sense. 
Each iteration is performed without knowledge of its effect on other 
iterations. Also, once a variable is brought into solution, it will never 
be removed unless equaiity conditions are not met. Even though each 
iteration improves the result, the optimum may not, of course, be 
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obtainable without a subsequent exchange of in-solution with out-of- 
solution real variables. A method of variable exchanging was not 
implemented because, with our subitem linkage restrictions, it would 
call for exchanges of the magnitude of 26/15 (26 variables in-solution 
exchanged for 15 variables out-of-solution) or more. An exchange 
algorithm would require a tremendous amount of time just to decide 
that a 26/15 exchange was necessary. Fortunately, due to our methods 
of eliminating certain assignments and identifying combinations of 
good fit prior to model generation, suboptimal solutions have been 
found empirically in only 2 percent of the solutions. 


VI. EASE OF USE 


This method of solution comprises the model and the algorithm. 
The model provides direction for the algorithm in its attempt to find 
an optimal solution. 

If new objectives are added or existing ones are changed, the model 
can easily be modified and the algorithm need not be changed. The 
algorithm can operate on any objective with zero-one real variables in 
which either maximization or minimization of the slack or real varia- 
bles is desired. The result of the exponentiation operation on vari- 
ables can also be maximized or minimized. Variable interaction (i.e., 
djdz = 10), however, is not provided for. 

Since its initial implementation in August 1981, the model has 
undergone one revision. It was changed to handle two types of sub- 
items: (1) the subitems as previously described, and (2) subitems with 
a shipping policy tolerance and cutting allowance different from those 
previously described. Some objective types in the model required 
modification to provide for this but the algorithm has not required 
modification to accommodate model changes. 

It is also important that the algorithm was coded in such a way that 
our use of multiple preemptive priorities has only a minimal effect on 
computer processing time. 


Vil. CONCLUSION 


Assignment problems of large size are difficult to solve. Hence, a 
great deal of attention among operations researchers has been devoted 
to the subject. To our knowledge, as yet no usable technique guarantees 
optimal solutions to large problems. Large assignment problems with 
the additional complexities of this particular assignment problem are 
even more difficult to solve. The nonlinear zero-one combinatorial 
goal programming model with constructive initial placement algorithm 
that we developed solves large and complex assignment problems 
satisfactorily. It is efficient, easy to modify, and produces good results. 
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APPENDIX 


The Mathematical Formulation (Initial Implementation Formulation— 
August 1981) 
Let: 
R; = Reel; of cable large enough to hold subitem,; 
S; = Subitem; that can be taken from reel; of cable 
Xj; = 1 if subitem; assigned to reel;, 0 otherwise 
CA = 2 (Cutting Allowance) 
SA; = Amount ordered on subitem; 
RC; = Amount of cable on reel; 
LATENESS; = Degree of lateness of subitem; 
SP = Shipping policy (assignment) tolerance 
SL = Maximum amount allowed to scrap 


d~,d*=0 
A.1 Subitem contingency 
> Xi = by Xij41 

ALL R; ALL REELS; 
ON PLANT, ON PLANT, 
OF LINKUP; LARGE ENOUGH FOR 

SUBITEM).; OF 
LINKUPy 
-V LINKUP,, SUBITEM; OF LINKUP,, 
PLANT, 

where 1 <s ; < number of subitems in LINKUP,, 
where LINKUP, = subitem contingency of subitems involving 
SUBITEM,. 


A.2 Reel capacity 
> (SA; + CA)X;; + dz; = RC; + CA V REEL; 


ALL S; 
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A.3 Combinations 


a. > X;,+ (NOSUBFIT)X, + dz = NOSUBFIT; 


ALL SUBITEMS; 
IN GOOD MATCH 
ON REEL; 


-V REEL; where 
a MATCH, 
exists 


b. X, + d3p; = 1 VW MATCH, 
WEIGHT = 500 — FITDIFF; + (NOSUBFIT;1000), 
where FITDIFF; = the footage short of an absolutely perfect match 
NOSUBFIT; = the number of subitems in the fit on REEL; 


X, is a variable generated to represent MATCH, 
Minimize at Priority 5: (d33; WEIGHT) V;. 
A4. Perfect one-to-one matches 
Xy+ dyj= 1 WV REEL; SUBITEM,; where a one-to-one match exists 


WEIGHT = (10,000,000 — (SA,1000) + (1000 — REELRANK;), 


where REELRANK; = ranking of REEL; relative to other reels by 
amount of cable on reel 


Minimize at Priority 4: (dg; WEIGHT) V;,;. 


A.5 Assignment of future subitems 
¥ Aye co by Xj; + d5; = 0 V REEL; 
ALL FUTURE ALL CURR. 


S AND LATE 
Sj 


A.6 Assignment of current and late subitems 
> Xy + de = 1 V SUBITEM; 


ALL R; 


WEIGHT = [(LATENESS; + 1)100,000] + SA;, if SUBITEM; not 
future; 0, if SUBITEM,; is future 


Minimize at Priority 6: (dg; WEIGHT) Vj. 


A.7 Forced subitems 
> Y Aytdjy =a 


ALL FORCED ALL R; 
SUBITEMS; 
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WEIGHT = 1,000,000 
Minimize at Priority 6: (dj WEIGHT). 
A.8 Use of full reels 
> (SA; + CA + SP)X;; + dg; — di = RC; + CA — 51 V REEL; 


ALL S; 
Maximize at Priority 7: d3; Vj. 
A.9 Minimization of scrap 


¥ (SA; + CA + SP)X; + ds: — dé; = RC; — SL + CA V REEL; 


ALL S; 
Minimize at Priority 8: (1/d;) Vi. 
A.10 Remnants of just over the scrapping limit 


> (SA; + CA)Xi; + dio: — dio; = RC; — SL — 50 V REEL; 


ALL S; 
Minimize at Priority 9: dio; V;. 
A.11 Remnants of less than 500 feet 
Y (SA; + CA)X, + di — dh; = RC; — 500 V REEL; 


ALL S; 
Minimize at Priority 10: dij; Vi. 
A.12 Use of minimum number of reels and largest reel 


> Xij + dijo; — dj = 1 V REEL; 
ALL S; 


Maximize at Priority 11: djp; V; 
Minimize at Priority 12: (RCd{q;) V;. 


A.13 Vacation 
a. 2 » Xij + diza = 0 


ALL CURRENT ALL REELS; 
AND LATE OF PLANTS 
Sj NOT ON 
VACATION 
DURING 
PREPARATION 
OF SUBITEM; 


Minimize at Priority 1: dig. 
b. > > Xij + disp = 0 


ALL CURRENT ALL REELS; 
AND LATE OF PLANTS 
S; ON VACATION 
ONE WEEK 
DURING 
PREPARATION 
OF SUBITEM; 
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Minimize at Priority 2: djgp. 


A.14 Transportation 


> {[1,000,000(LATENESS,; + 1)]/ 


ALL SUBITEMS; ALL R; 


(TRANSPRT;;SA;) + 1X + diz = &, 
where TRANSPRT = transportation rate from stocking location of 
REEL, to job site of SUBITEM; 
Minimize at Priority 3: dj4. 


Note: For generated variables representing a combination of good 
match, LATENESS; is the largest degree of lateness of all orders 
represented by the generated variable. 


AUTHORS 


Julian H. Fletcher, B.S. (Industrial Engineering), 1970, Georgia Institute of 
Technology; AT&T Technologies, Inc., 1970—. Mr. Fletcher’s assignments 
include design and implementation of aggregate plant loading and inventory 
management systems for products manufactured within the Cable and Wire 
Division of AT&T Technologies, Inc. He is in the Information Systems 
Development Organization at the Atlanta Works. Member, the Institute of 
Management Sciences (TIMS). 


Kenneth R. Lipske, B.S. (Mathematics), 1970, University of Wisconsin, 
Eau Claire; AT&T Technologies, Inc., 1970—. Mr. Lipske’s assignments entail 
optimizing the aggregate plant loading and inventory management functions 
for the various product lines manufactured by the Cable and Wire Division of 
AT&T Technologies, Inc. He is in the Information Systems Development 
Organization at the Atlanta Works. Member, The Institute of Management 
Sciences (TIMS). 


SOLVING MULTIOBJECTIVE ASSIGNMENT PROBLEMS 677 


PAPERS BY AT&T BELL LABORATORIES AUTHORS 


COMPUTING/MATHEMATICS 


Adleman L. M., Odlyzko A. M., Irreducibility Testing and Factorization of Poly- 
nomials. Math Comput 41(164):699-709, 1983. 

Allen R. B., Scerbo M. W., Details of Command-Language Keystrokes. ACM T 
Offic 1(2):159-178, 1983. 

Barmish B. R., Petersen I. R., Feuer A., Linear Ultimate Boundedness Control of 
Uncertain Dynamical Systems. Automatica 19(5):523-532, 1983. 

Barron E. N., Jensen R., A Nonlinear Evolution System With Two Subdifferen- 
tials and Monotone Differential Games. J Math Anal 97(1):65-80, 1983. 

Bentley J., Cracking the Oyster (Editorial). Comm ACM 26(8):550-552, 1983. 
Le D., Hwang F. K., Multiplicative Magic Squares. Discr Math 47(1):1-11, 
1983. 

Brown M. K., Ganapathy S., Preprocessing Techniques for Cursive Script Word 
Recognition. Patt Recog 16(5):447-458, 1983. 

sae Z R. K., Unavoidable Stars in Three Graphs. J Comb Th A 35(3):252- 
Conway J. H., Sloane N. J. A., A Fast Encoding Method for Lattice Codes and 
Quantizers. IEEE Info T 29(6):820-824, 1983. 

Erdos P., Mullin R. C., Sos V. T., Stinson D. R., Finite Linear Spaces and Projective 
Planes. Discr Math 47(1):49-62, 1983. 

Fishburn P. C., Interval Lengths for Interval Orders—A Minimization Prob- 
lem. Discr Math 47(1):63-82, 1983. 

Garey M. R., Combinatorial Optimization—Algorithms and Complexity—Pa- 
padimitriou C. H., Steiglitz K. (Book Review). Am Scient 71(6):658, 1983. 
Garey M. R., Johnson D. S., Crossing Number is NP-Complete. Siam J Alg 
4(3):312-316, 1983. 

Gershenfeld N. A., Schadler E. H., Bilaniuk O. M., APL and the Numerical Solution 
of High-Order Linear-Differential Equations. Am J Phys 51(8):743-746, 1983. 
Johnson D. S., Klug A., Optimizing Conjunctive Queries That Contain Untyped 
Variables. SIAM J Comp 12(4):616-640, 1983. 

Larson R. G., Elongational-Flow Predictions of the Leonov Constitutive Equa- 
tion. Rheol Act 22(5):435-448, 1983. 

Levy L. S., A Walk Through AWK. Sigplan Not 18(12):69-85, 1983. 

Lu P. M., Yau S.S., Hong W., A Formal Methodology Using Attributed Grammars 
for Multiprocessing-System Software Development. 1. Design Representa- 
tion. Inf Sci 30(2):79-105, 1983. 


LuP. M., YauS.S., pear W., A Forma! Methodology Using Attributed Grammars 
for Multiprocessing-System Software-Development. 2. Validation. Inf Sci 
30(2):107-123, 1983. 

Luss H., An Extended Model for the Optimal Sizing of Records. J Oper Res 
34(11):1099-1105, 1983. 

Mitra D., Morrison J. A., Asymptotic Expansions of Moments of the Waiting 
Time an a Shared-Processor of an Interactive System. Siam J Comp 12(4):789- 
802, 1983. 

Petersen W. P., Vector FORTRAN for Numerical Problems on CRAY 1. Comm 
ACM 26(11):1008-1021, 1983. 

Vo K. P., Whitney R., Tableaux and Matrix Correspondences. J Comb Th A 
35(3):328-359, 1983. 


ENGINEERING 
Agrawal G. P., Dutta N. K., Effect of Auger Recombination on the Threshold 


Characteristics of Gain-Guided InGaAsP Lasers. Electr Lett 19(23):974-976, 
1983. 


679 


Alferness R. C., Joyner C. H., Buhl L. L., Korotky S. K., High-Speed Traveling- 
Wave Directional Coupler Switch Modulator for Lambda = 1.32 pm. IEEE J 
Q El 19(9):1339-1341, 1983. 

Bhatt R. N., Lee P. A., Low-Temperature Conductivity of Doped Semiconduc- 
tors—Mass Anisotropy and Intervalley Effects. Sol St Comm 48(9):755-759, 
1983. 

Chraplyvy A. R., Kaminow I. P., Dentai A. G., Evaluation of InGaAsP Laser 
Material by Optical Pumping. IEEE J Q El 19(9):1342-1344, 1983. 

Citrin P. H., Rowe J. E., Local-Structure of Adsorbates on Semiconductor Sur- 
faces Using SEXAFS—A Brief Summary. Surf Sci 132(1-3):205-211, 1983. 
Dupuis R. D., Hartman R. L., Nash F. R., Facet-Coated Graded-Index Separate- 
Confinement-Heterostructure Single-Quantum- Well Lasers Having Low Deg- 
radation Rates (L1 Percent/kh) at 70 Degrees C. IEEE Elec D 4(8):286-288, 
1983. 

Engelmaier W., Fatigue Life of Leadless Chip Carrier Solder Joints During 
Power Cycling. IEEE Compon 6(3):232-237, 1983. 

Gans M. J., Amitay N., Narrow Multibeam Satellite Ground Station Antenna 
Employing a Linear Array With a Geosynchronous Arc Coverage of 60 De- 
grees. 2. Antenna Design. IEKE Antenn 31(6):966-972, 1983. 

Haskell B. G., High Definition Television (HDTV)—Compatibility and Distri- 
bution (Letter). IEEE Commun 31(12):1308-1317, 1983. 

Heimann P. A., An Operational Definition for Breakdown of Thin Thermal 
Oxides of Silicon. IEEE Device 30(10):1366-1368, 1983. 


Henry C. H., Theory of the Phase Noise and Power Spectrum of a Single-Mode 
Injection Laser. IEEE J Q El 19(9):1391-1397, 1983. 

Hsu F. C., Muller R. S., Hu C. M., Ko P. K., A Simple Punchthrough Model for 
Short-Channel MOSFETs. IEEE Device 30(10):1354-1359, 1983. 


Johnson A. M., Simpson W. M., Continuous-Wave Mode-Locked Nd:YAG- 
Pumped Subpicosecond Dye Lasers. Optics Lett 8(11):554-556, 1983. 

Kaminow I. P., Ko J. S., Linke R. A., Stulz L. W., High-Speed 1.55 um Single- 
Longitudinal-Mode Ridge Wave-Guide C3 Laser. Electr Lett 19(19):784-785, 
1983. 


Kavehrad M., Adaptive Decision Feedback Cancellation of Intersymbol Inter- 
ference Over Multipath Fading Radio Channels. ICC 2: C8.5.1-C8.5.3, 1983. 


Knausenberger W. H., Teneketges N. A., High Pinout-IC Packaging and the 
Density Advantage of Surface Mounting. IEEE Compon 6(3):298-304, 1983. 
Kuo W., Kuo Y., Facing the Headaches of Early Failures—A State-of-the-Art 
Review of Burn-In Decisions (Review). P [EEE 71(11):1257-1266, 1983. 
Lamorte M. F., Abbott D. H., Influence of Band-Gap Span on Cascade Solar-Cell 
Efficiency. Solar Cells 10(1):33-48, 1983. 

Leheny R. F., Liao A., Tell B., Mayer L., Integrated Photoreceivers for Long 
Wavelength Communications Systems—Is the Technology Ready? P Soc 
Photo 408:115-120, 1983. 

Linke R. A., Kasper B. L., Ko J. S., Kaminow I. P., Vodhanel R. S., 1 Ghit/s 
Transmission Experiment Over 101 km of Single-Mode Fiber Using a 1.55 
um Ridge Guide C* Laser. Electr Lett 19:775-776, 1983. 

Liu P. L., Fencil L. E., Ko J. S., Kaminow I. P., Lee T. P., Burrus C. A., Amplitude 
Fluctuations and Photon Statistics of InGaAsP Injection Lasers. IEEE J Q El 
19(9):13848-1351, 1983. 

Manheimer M. A., Bhagat S. M., Chen H. S., Reentrant Magnetism—A Low Field 
Study. J Magn Magn 38(2):147-158, 1983. 

Marcuse D., Lee T. P., On Approximate Analytical Solutions of Rate-Equations 
for Studying Transient Spectra of Injection Lasers. IEEE J Q El 19(9):1397- 
1406, 1983. 


Netravali A. N., Pirsch P., Character Display ona CRT. IEEE Broadc 29(3):106- 
110, 1983. 


Ross I. M., Technology for Tomorrow (Editorial). IEEE Comm M 21(9):42-45, 
1983. 


Schaper L. W., Amey D. I., Improved Electrical Performance Required for Future 
MOS Packaging. IEEE Compon 6(3):283-289, 1983. 


680 TECHNICAL JOURNAL, APRIL 1984 


Schwab T. F., Yau S. S., An Algebraic Model of Fault-Masking Logic Cir- 
cuits. IEEE Comput 32(9):809-825, 1983. 

Sproles E. S., Sharma S. P., Effect of HCl and Cl, on Pd-Inlay Coupons and Pd- 
Connector Contacts. IEEE Compon 6(3):343-348, 1983. 

Stolen R. H., Applications of Nonlinear Phenomena in Optical Fibers. P Soc 
Photo 408:156-162, 1983. 

Tsang W. T., Olsson N. A., Logan R. A. Ditzenberger J. A., Kaminow I. P., Ko J. S., 
Single-Longitudinal Mode Performance—Characteristics of Cleaved-Cou- 
pled-Cavity Lasers. Appl Phys L 43(11):1003-1005, 1983. 

Worlock J. M., Wave Vector Dependent Magneto-Optical Interband Transitions 
in Semiconductors. Sol St Comm 48(12):1067-1070, 1983. 


MANAGEMENT/ECONOMICS 


Fishburn P. C., Transitive Measurable Utility. J Econ Theo 31(2):293-317, 1983. 
Gilstein C. Z., Leamer E. E., Robust Sets of Regression Estimates. Econometric 
51(2):321-333, 1983. 

Gordon R. H., An Optimal Taxation Approach to Fiscal Federalism. Q J Econ 
98(4):567-586, 1983. 

Norman A. L., Lasdon L. S., Hsin J. K., A Comparison of Methods for Solving and 
Soimizing a Large Nonlinear Econometric Model. J Econ Dyn 6(1-2):3—24, 
1983. 


Stiglitz J. E., Weiss A., Incentive Effects of Terminations—Applications to the 
Credit and Labor Markets. Am Econ Rev 73(5):912-927, 1983. 


PHYSICAL SCIENCES 


Abrahams S. C., Liminga R., Marsh P., Schrey F., Albertsson J., Svensson C., Kvick 
A., Thermal Expansivity of a-LilO3; Between 20 and 520K. J Appl Crys 
16(OCT):453-457, 1983. 

Aeppli G. et al., Spin Correlations and Reentrant Spin-Glass Behavior in Amor- 
phous Fe-Mn Alloys—Statics. Phys Rev B 28(9):5160-5172, 1983. 


Aharonshalom E., Heller A., Injection Electro-Luminescence of N-InP in Dilute 
Nitric Acid. J Phys Chem 87(24):4913-4918, 1983. 


Aspnes D. E., Heller A., Photoelectrochemical Hydrogen Evolution and Water- 
Photolyzing Semiconductor Suspensions—Properties of Platinum Group 
Metal Catalyst Semiconductor Contacts in Air and in Hydrogen. J Phys Chem 
87(24):4919-4929, 1983. 

Bacon D. D., English A. T., Nakahara S., Peters F. G., Schreiber H., Sinclair W. R., 
Vandover R. B., Properties of NbN Thin Films Deposited on Ambient-Temper- 
ature Substrates. J Appl Phys 54(11):6509-6516, 1983. 

Bondybey V. E., English J. H., Electronic Structure and Spectra of SnBi. J Chem 
Phys 79(10):4746-4748, 1983. 

Bondybey V. E., English J. H., Observation and Characterization of a New 
Electronic State of Cu-2 in Solid Neon. J Phys Chem 87(23):4647—4650, 1983. 
Bowmer T. N., Vroom W. I., Radiation Crosslinking of Poly(vinyl-Chloride) With 
Trimethylolpropanetrimethacrylate. 4. Effect of Diundecyl Phthalate—De- 
pendence of Physical Properties on Composition. J Appl Poly 28(11):3527-3548, 
1983. 

Brasen D., Bonner W. A., Effect of Temperature and Sulfur Doping on the Plastic 
Deformation of InP Single Crystals. Mater Sci E 61(2):167-172, 1983. 

Braun C. L., Scott T. W., Picosecond Measurements of Time-Resolved Geminate 
Charge Recombination (Letter). J Phys Chem 87(24):4776-4778, 1983. 
Broughton J. Q., Gilmer G. H., Molecular-Dynamics Investigation of the Crystal 
Fluid Interface. 1. Bulk Properties. J Chem Phys 79(10):5095-5104, 1983. 
Broughton J. Q., Gilmer G. H., Molecular-Dynamics Investigation of the Crystal 
Fluid Interface. 2. Structures of the FCC (111), (100), and (110) Crystal Vapor 
Systems. J Chem Phys 79(10):5105-5118, 1983. 


PAPERS BY AT&T BELL LABORATORIES AUTHORS 681 


Broughton J. Q., Gilmer G. H., Molecular-Dynamics Investigation of the Crystal 
Fluid Interface. 3. Dynamical Properties of FCC Crystal Vapor Systems. J 
Chem Phys 79(10):5119-5127, 1983. 


Brus L. E., A Simple Model for the Ionization Potential, Electron Affinity, and 
Aqueous Redox Potentials of Small Semiconductor Crystallites. J Chem Phys 
79(11):5566-5571, 1983. 

Burton R. H., Hollien C. L., Marchut L., Abys S. M., Smolinsky G., Gottscho R. A., 
Etching of Gallium-Arsenide and Indium-Phosphide in RF Discharges 
Through Mixtures of Trichlorofluoromethane and Oxygen. J Appl Phys 
54(11):6663-6671, 1983. 

Capasso F., Alavi K., Cho A. Y., Foy P. W., Bethea C. G., New Long Wavelength 
Alo.sIno.5s2As/Gao.47InossAs Avalanche Photo-Diode Grown by Molecular- 
Beam Epitaxy. Appl Phys L 43(11):1040-1042, 1983. 

Capasso F., Luryi S., Tsang W. T., Bethea C. G., Levine B. F., New Transient 
Electrical-Polarization Phenomenon in Sawtooth Super Lattices. Phys Rev L 
51(25):2318-2321, 1983. 


Capasso F., Tsang W. T., Hutchinson A. L., Bethea C. G., Levine B. F., Novel Graded 

Band-Gap Photodetectors With Ultrahigh Speed of Response (t, = 20 ps, 

FWHM = 40 ps) and Phototransistors with Graded Gap Base. Inst Phys 

C1983(65):225-231, 1983. 

Cava R. J., Murphy D. W., Zahurak S. M., Lithium Insertion in Wadsley-Roth 

Phases Based on Niobium Oxide. J Elchem So 130(12):2345-2351, 1983. 

Cava R. J., Santoro A., Murphy D. W., Zahurak S. M., Roth R. S., The Structures of 

the Lithium Inserted Metal-Oxides Lip2ReO3 and Lio3gWOs3. J Sol St Ch 

50(1):121-128, 1983. 

Chance B., Kumar C., Powers L., Ching Y., Peroxidatic Form of Cytochrome- 

cee as Studied by X-Ray Absorption Spectroscopy. Biophys J 44(3):353- 
63, 1983. 

Chen C. H., Fleming R. M., Strandlike Microstructures in the Charge-Density 

Wave States of Orthorhombic TaS3. Sol St Comm 48(9):777-779, 1983. 

Chen H. S., Kurkjian C. R., Sub-Sub-T, Enthalpy Relaxation in a B203 Glass. J 

Am Ceram 66(9):613-619, 1983. 

Chu S. N. G., Jodlauk C. M., Johnston W. D., Morphological Study of Thermal 

Decomposition of InP Surfaces. J Elchem So 130(12):2398-2405, 1983. 

Cohen R. L., West K. W., Intrinsic Cycling Degradation in LaNi,; and Annealing 

Procedures for Reforming the Material. J Lessc Met 95(1):17-23, 1983. 

Coppersmith S. N., Scaling Behavior Near the Pinning Transition of the Discrete 

Frenkel-Kontorova Model. J Physique 44(NC-3):1579-1582, 1983. 

Digiuseppe M. A., Temkin H., Peticolas L., Bonner W. A., Quantum Well Structures 

of Ino.ssGao.47As/InP Grown by Hydride Vapor-Phase Epitaxy in a Multiple 

Chamber Reactor. App! Phys L 43(10):906-908, 1983. 

Duncan T. M., Winslow P., Bell A. T., C-13 NMR Spectra of Carbonaceous 

ee on Silica-Supported Ruthenium Catalysts. Chem P Lett 102(2-3):163- 

167, 1983. 


Fleming R. M., Chen C. H., Moncton D. E., Electric-Field-Induced Structural 
Changes in NbSe3. J Physique 44(NC-3):1651-1658, 1983. 

Forrest S. R., Kim O. K., Smith R. G., Analysis of the Dark Current and Photo- 
response of Ino.53Gao,47As/InP Avalanche Photodiodes. Sol St Elec 26(10):951- 
968, 1983. 

Graedel T. E., Franey J. P., Kammlott G. W., The Corrosion of Copper by Atmos- 
pheric Sulfurous Gases. Corros Sci 23(11):1141+, 1983. 

Gross B., Von Seggern H., Berkley D. A., Long-Term Behavior of Radiation- 
Induced Currents in Fluorinated Ethylene Propylene Co-Polymer. Phys St 
S-A 79(2):607-615, 1983. 

Guntherodt G., Jayaraman A., Batlogg G., Croft M., Melczer E., Raman Scattering 
From Coupled Phonon and Electronic Crystal-Field Excitations in 
CeAls. Phys Rev L 51(25):2330-2332, 1983. 

Habbal F., Bevk J., Interface Flux Pinning in Insitu Formed Superconducting 
Composites. J Appl Phys 54(11):6543-6548, 1983. 

Hayes J. R., Capasso F., Malik R. J., Gossard A. C., Wiegmann W., Optimum Emitter 
Grading for Heterojunction Bipolar Transistors. Appl Phys L 43(10):949-951, 
1983. 


682 TECHNICAL JOURNAL, APRIL 1984 


Heiney P. A. et al., X-Ray-Scattering Study of the Structure and Freezing 
Transition of Monolayer Xenon on Graphite. Phys Rev B 28(11):6416-6434, 
1983. 

Hemenway B. R., Bowers J. E., Miller B. I., Anisotropic Undercutting in (100) 
Indium-Phosphide. Electr Lett 19(24):1049-1051, 1983. 

Henein G. E., Wagner W. R., Stresses Induced in GaAs by TiPt Ohmic Con- 
tacts. J App! Phys 54(11):6395-6400, 1983. 

Henkel C., Wilson T. L., Walmsley C. M., Pauls T., Formaldehyde Towards Com- 
pact H-II Regions—Densities and Isotope Ratios. Astron Astr 127(2):388-394, 
1983. 

Hensel J. C., Halperin B. I., Dynes R. C., Dynamical Medel for the Absorption and 
Scattering of Ballistic Phonons by the Electron Inversion Layer in Sili- 
con. Phys Rev L 51(25):2302-2305, 1983. 

Hilinski E. F., Milton S. V., Rentzepis P. M., Picosecond Spectroscopy and Appli- 
cations to Chemical and Biological Systems (Review). ACS Symp S 
1983(236):201-220, 1983. 

Hou T. W., Mogab C. J., Wagner R. S., Ion Milling Planarization for Magnetic- 
Bubble Devices. J Vac Sci x 1(4): 1801- 1805, 1983. 

Inoue A., Masumoto T., Hagiwara M., Chen H. S., The Structural Relaxation 
Behavior of PdgsNis2P 50, FezsSizoBis. and Cor sSise.sBis Amorphous Alloy Wire 
and Ribbon. Scrip Metal 17(10):1205-1208, 1983. 

Jindal R. P., A Normalized Analytical Solution for the Capacitance Associated 
With Uniformly Doped Semiconductors at Equilibrium. Sol St Elec 
26(10):1005-1008, 1983. 

Johnson D. W., Rabinovich E. M., Macchesney J. B., Vogel E. M., Preparation of 
High-Silica Glasses from Colloidal Gels. 2. Sintering. J Am Ceram 66(10):688- 
693, 1983. 

Kastalsky A., Dingle R., Cheng K. Y., Cho A. Y., Two-Dimensional Electron-Gas 
at an MBE-Grown, Selectively-Doped, Ino.s3Gao.47As/Inos2Alo4sAs Inter- 
face. Inst Phys C1983(65):181-185, 1983. 

Kevan S. D., Design of a High-Resolution Angle-Resolving Electron-Energy 
Analyzer. Rev Sci Ins 54(11):1441-1445, 1983. 

Kohl P. A., Wolowodiuk C., Ostermayer F. W., The Photoelectrochemical Oxidation 
of (100), (111), and (111) N-InP and N-GaAs. J Elchem So 130(11):2288-2293, 
1983. 


Lester M. I., Zegarski B. R., Miller T. A., Laser-Induced Fluorescence Studies of 
Large and Small Molecular Cations Produced by Using Electron Bombardment 
in a Free Jet Expansion. J Phys Chem 87(25):5228-5233, 1983. 

Levinson M., Stavola M., Benton J. L., Kimerling L. C., Metastable M-Center in 
InP—Defect-Charge-State Controlled Structural Relaxation. Phys Rev B 
28(10):5848-5855, 1983. 

Lifshitz N., Solubility of Implanted Dopants in Polysilicon—Phosphorus and 
Arsenic. J Elchem So 130(12):2464-2467, 1983. 

Meiboom S., Sammon M., Berreman D. W., Lattice Symmetry of the Cholesteric 
Blue Phases. Phys Rev A 28(6):3553-3560, 1983. 

Miller R. C., Gossard A. C., Some Effects of a Longitudinal Electric Field on the 
Photoluminescence of Paradoped GaAs-Al,Ga);.,As Quantum Well Hetero- 
structures. Appl Phys L 43(10):954-956, 1983. 

Mitchell J. W. et al., Microscale Homogeneity and Compositional Profiling of 
Borosilicate Glass Materials. Mikroch Act 3(3—4):253-261, 1983. 

Murphy D. W., Dye J. L., Zahurak S. M., Alkali-Metal Insertion in the Pyrochlore 
Structure (Letter). Inorg Chem 22(25):3679-3681, 1983. 

Nakahara S., Abys J. A., Abys S. M., Room-Temperature Diffusion-Induced Grain- 
Boundary Migration in the Fine-Grained Pd Side of Cu-Pd Diffusion Cou- 
ples. Mater Lett 2(2):155-159, 1983. 

Paalanen M. A., Tsui D. C., Hwang J. C. M., Parabolic Magnetoresistance From 
the Interaction Effect in a Two-Dimensional Electron Gas. Phys Rev L 
51(24):2226-2229, 1983. 

Pearson D. S. et al., Comparison of the Rheological Properties of Linear and 
Star-Branched Polyisoprenes in Shear and Elongational Flows. J Pol Sc PP 
21(11):2287-2298, 1983. 

Pfeiffer L., Celler G. K., Kovacs T., Robinson M., Suppression of Low-Angle Grain- 


PAPERS BY AT&T BELL LABORATORIES AUTHORS 683 


Boundaries in Seeded Thick Si Films Recrystallized Between SiO, Lay- 
ers. Appl Phys L 43(11):1048-1050, 1983. 

Phillips J. M., Feldman L. C., Gibson J. M., McDonald M. L., Epitaxial Growth of 
Alkaline-Earth Fluorides on Semiconductors. Thin Sol Fi 107(3):217-226, 1983. 
Pini R., Salimbeni R., Matera M., Lin C. L., Wideband Frequency Conversion in 
the UV by Nine Orders of Stimulated Raman Scattering in a XeCl Laser 
Pumped Multimode Silica Fiber. Appl Phys L 43(6):517-518, 1983. 

Rabinovich E. M., Johnson D. W., Macchesney J. B., Vogel E. M., Preparation of 
High-Silica Glasses from Colloidal Gels. 1. Preparation for Sintering and 
Properties of Sintered Glasses. J Am Ceram 66(10):683-688, 1983. 

Roth H. D., Schilling M. L., Radical Cations of Tetraalkylcyclopropanes—The 
Cyclopropane Moiety as an Electron Donor. J Am Chem S 105(23):6805-6808, 
1983. 

Schleyer P. V. et al., The 1,2-Hydride Shift Barrier in the Phenyl Cation—An 
Abinitid Study. J Chem S CH1983(22):1296-1298, 1983. 

Seidel T. E., Rapid Thermal Annealing of BF.* Implanted, Preamorphized 
Silicon (Letter). IEEE Elec D. 4(10):353-355, 1983. 

Smith C. G., Okinaka Y., High-Speed Gold Plating——-Anodic Bath Degradation 
and Search for Stable Low Polarization Anodes. J Elchem So 130(11):2149- 
2157, 1983. 

Smith N. V., Inverse Photoemission and Related Techniques. Vacuum 33(10-1): 
803-811, 1983. 

Swaminathan V., Wagner W. R., Anthony P. J., Effect of N-Type and P-Type 
Doping on the Microhardness of GaAs, (Al,Ga)As and Ga(As,Sb) Active Layers 
in 0.82 and 0.87 um Injection Lasers. J Elchem So 130(12):2468-2472, 1983. 
Tarcha P. J., Fitch R. M., Dumais J. J., Jelinski L. W., Particle Morphology of Self- 
Hydrolyzed Acrylate Polymer Colloids—A C-13 NMR and DSC Study. J Pol 
Sc PP 21(11):2389-2402, 1983. 

Teo B. K., A Simple Electron Counting Rule for Close-Packed High Nuclearity 
Metal Clusters. J Chem S CH1983(22);1362-1364, 1983. 

Tonelli A. E., Belfiore L. A., Solution Dipole Moments of Atactic Poly(Para- 
Chlorostyrene) and Poly(Para-Bromostyrene) and Their Dependence on Tem- 
perature and Solvent. Macromolec 16(11):1740-1743, 1983. 


Trimble L. E., Celler G. K., Novel Laser Scanning Techniques for Si-on-Insulator 
Devices. P Soc Photo 385:8-15, 1983. 


Tu C. W., Forrest S. R., Johnston W. D., Epitaxial InP/Fluoride/InP(001) Double 
Heterostructures Grown by Molecular Beam Epitaxy. Appl Phys L 43(6):569- 
571, 1983. 

Vasile M. J., The Reaction Probability of XeF2, With Silicon. J Appl Phys 
54(11):6697-6704, 1983. 

Venkatesan T., Wolf T., Allara D., Wilkens B. J., Taylor G. N., Synthesis of Novel 
Inorganic Films by Ion-Beam Irradiation of Polymer Films. Appl Phys L 
43(10):934-936, 1983. 

Werner A., Hochheimer H. D., Stroéssner K., Jayaraman A., High-Pressure X-Ray 
Diffraction Studies on H,T, and H,S to 20 GPa. Phys Rev B 28(6):3330-3334, 
1983. 

Wernick J. H., Hull G. W., Geballe T. H., Bernardini J. E., Waszczak J. V., Supercon- 
ductivity in Ternary Heusler Intermetallic Compounds. Mater Lett 2(2):90-92, 
1983. 

Wertheim G. K. et al., Unit Charge on Supported Gold Clusters in Photoemission 
Final State. Phys Rev L 51(25):2310-2313, 1983. 

Wilson B. A., Hu P., Jedju T. M., Harbison J. P., Subnanosecond Radiative and 
Nonradiative Processes in Alpha-Si-H. Phys Rev B 28(10):5901-5907, 1983. 
Wilson R. J., Petroff P. M., Cryopump and Sample Holder for Clean Recon- 
structed Surface Observations in a Transmission Electron Microscope. Rev 
Sci Ins 54(11):1534-1537, 1983. 

Wood D. L., Rabinovich E. M., Johnson D. W., Macchesney J. B., Vogel E. M., 
Preparation of High-Silica Glasses from Colloidal Gels. 3. Infrared Spectro- 
photometric Studies. J Am Ceram 66(10):693-699, 1983. 

Yachandra V., Powers L., Spiro T. G., X-Ray Absorption Spectra and the Coordi- 


684 TECHNICAL JOURNAL, APRIL 1984 


nation Number of Zn and Co Carbonic Anhydrase as a Function of pH and 
Inhibitor Binding. J Am Chem S 105(22):6596-6604, 1983. 

Yafet Y., Conduction Electron-Spin Relaxation in the Superconducting 
State. Phys Lett A 98(5-6):287-290, 1983. 


SOCIAL AND LIFE SCIENCES 
Scott T. W. et al., Subunit Heterogeneity in the Structure and Dynamics of 


Hemoglobin—A Transient Raman Study. FEBS Letter 158(1):68-72, 1983. 


Wieland S. J., Gelperin A., Dopamine Elicits Feeding Motor Program in Limax 
maximus. J Neurosc 3(9):1735-1745, 1983. 


SPEECH/ACOUSTICS 


Duttweiler D. L., Subsampling to Estimate Delay with Application to Echo 
Canceling. IEEE Acoust 31(5):1090-1099, 1983. 


PAPERS BY AT&T BELL LABORATORIES AUTHORS 685 


CONTENTS, MAY-JUNE 1984 


Heavy-Traffic Approximations for Service Systems With Blocking 
W. Whitt 


Blocking Probability in a Switching Center With Arbitrary Routing 
Policy 
B. Gopinath, J.-M. Garcia, and P. Varaiya 


A Vector Quantizer Combining Energy and LPC Parameters and Its 
Application to Isolated Word Recognition 
L. R. Rabiner, M. M. Sondhi, and S. E. Levinson 


Generic Approaches to the Design of Network Service Circuits 
M. Malek-Zavarei, M. C. Chow, and J. D. Williams 


1980 Bell System Noise Survey of the Loop Plant 
D. V. Batorsky and M. E. Burke 


On the Accuracy of Forecasting Telephone Usage Demand 
M. N. Youssef 


687 











AT&T BELL LABORATORIES TECHNICAL JOURNAL is abstracted or indexed by 
Abstract. Journal in Earthquake Engineering, Applied Mechanics Review, Applied 
Science & Technology Index, Chemical Abstracts, Computer Abstracts, Current 
Contents/Engineering, Technology & Applied Sciences, Current Index to Statistics, 
Current Papers in Electrical & Electronic Engineering, Current Papers on Computers & 
Contol, Electronics & Communications Abstracts Journal, The Engineering Index, 
International Aerospace Abstracts, Journal of Current Laser Abstracts, Language and 
language Behavior Abstracts, Mathematical Reviews, Science Abstracts (Series A, 
Physics Abstracts; Series B, Electrical and Electronic Abstracts; and Series C, Computer 
& Control Abstracts), Science Citation Index, Sociological Abstracts, Social Welfare, 
Social Planning and Social Development, and Solid State Abstracts Journal. 
Reproductions of the Journal by years are available in microform from University 
Microfilms, 300 N. Zeeb Road, Ann Arbor, Michigan 48106. 


Kl) 





ANG 


Bell Laboratories 


